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Scientific  Progress 

Major  Developments  in  Year  1: 

There  are  two  major  aspects  to  the  development  of  our  biosensor:  the  application  of  synthetic  biology  to  develop  novel  microbial 
sensor  strains  that  will  have  sensitive  and  specific  responses  to  critical  water  toxins,  and  the  use  of  state-of-the-art  microfluidic 
techniques  and  optical  technology  along  with  computational  biology  to  detect  and  interpret  the  signals  from  these  toxin-sensing 
organisms. 

Task  A  focuses  mainly  on  the  biological  aspect  of  this  project,  with  the  goal  of  identifying  combinations  of  cellular  signals  that 
can  be  harnessed  to  provide  specific  responses  to  the  presence  of  a  range  of  potential  water  toxins.  In  Q1 ,  we  searched  the 
literature  to  identify  known  cellular  signaling  pathways  responsive  to  our  toxins  of  interest  and  selected  several  candidate 
promoters  from  a  variety  of  microbial  organisms.  We  designed  plasmids  with  each  of  these  promoters  driving  GFP  and 
contracted  these  sequences  to  be  constructed.  As  proof  of  principle,  we  used  microfluidics  to  test  two  such  plasmids  that  we 
built  in-house.  We  subjected  these  preliminary  sensor  strains  to  various  toxin  levels  within  a  novel  microfluidic  chip,  and  we 
observed  bright  response  signals.  In  addition  to  taking  advantage  of  known  toxin-sensitive  pathways,  we  conducted  a  program 
of  Next  Generation  Sequencing  to  greatly  expand  the  number  of  known  response  promoters  for  each  toxin.  In  Q2,  we 
developed  novel  RNA-Seq  analysis  algorithms  to  identify  specific  differentially  expressed  genes  in  our  large  data  set.  We 
located  the  promoter  regions  of  the  most  promising  differentially  expressed  genes  and  designed  sensor  circuits  based  on  them. 
In  Q3,  we  completed  the  construction  of  plasmid-based  microbial  sensor  strains  for  all  toxins,  based  on  promoters  identified  via 
literature  searches  and  RNA-Seq.  We  used  microfluidics  to  demonstrate  the  proper  induction  of  each  strain  by  various  levels  of 
the  relevant  toxin. 

Task  B  focuses  primarily  on  mechanical  sensor  development,  including  the  microfluidic  device  design,  optical  technology,  and 
computational  tools  required  to  translate  a  series  of  optical  signals  from  multiple  sensor  strains  into  a  meaningful  toxin  level 
determination.  In  Q1,  we  developed  a  novel  microfluidic  device  to  culture  and  sequentially  expose  an  array  of  sensing  strains  to 
various  levels  of  our  toxins  of  interest  over  a  period  of  several  weeks.  This  “gill”  chip  contains  tall  cell  traps  that  provide  a  bright 
fluorescent  signal  from  a  large  population  of  cells.  We  also  developed  complimentary  technologies  that  allowed  us  to  mix 
concentrated  media  with  a  natural  water  source  and  to  dispose  of  the  microbial  species  safely  upon  exiting  the  device.  In  Q2, 
we  partnered  with  the  Ziva  Corporation  to  develop  a  low  cost,  field  capable  optical  system  to  image  our  microfluidic  devices.  We 
demonstrated  that  the  capabilities  of  this  optical  system  were  comparable  to  a  research  grade  microscope  at  low  optical  power. 
To  enable  viable  long-term  storage  of  biosensor  chips,  we  developed  a  method  to  deposit  and  freeze-dry  strains  in  a  defined 
array  within  our  microfluidic  device.  We  demonstrated  the  successful  revival  of  our  strains  after  four  weeks  of  room  temperature 
storage  with  little  loss  of  viability.  Lastly,  we  investigated  replacing  our  GFP  reporter  with  bacterial  luciferase  toward  the  goal  of 
increasing  SNR  by  eliminating  background  autofluorescence.  We  demonstrated  that  our  arsenic  sensor  modified  in  this  manner 
shows  much  greater  sensitivity,  even  compared  to  analytical  methods  approved  by  the  EPA  for  detecting  arsenic  in  drinking 
water.  In  Q3,  we  used  strain  response  data  to  train  our  classifier  to  identify  the  presence  of  each  toxin  of  interest  in  a  water 
source  of  unknown  composition  in  real  time.  In  Q4,  we  strengthened  classifier  performance  by  acquiring  long  data  sets  (up  to 
50  days)  of  both  on-  and  off-target  toxin  exposures  within  our  microfluidic  devices. 

Task  C  aims  to  develop  technologies  required  to  make  an  inexpensive  and  robust  sensor  prototype  that  can  be  deployed  in  real 
bodies  of  water  to  continuously  monitor  for  toxin  contamination.  In  Q1,  we  developed  a  peristaltic  pumping  system  to  mix  water 
and  growth  media  on-chip  and  a  filter  to  prevent  clogs  in  the  microfluidic  device.  In  Q2,  we  developed  a  comprehensive 
software  package  to  control  fluid  flow,  image  acquisition,  and  wireless  data  transfer.  We  also  constructed  a  solar  energy 
harvesting  system  for  powering  our  device  in  an  off-grid  field  environment.  In  Q3  and  Q4,  we  completed  the  development  of  a 
functional  prototype  of  our  biosensor  for  use  in  comprehensively  testing  our  sensor  strains.  We  integrated  the  biological,  fluidic, 
and  computational  components  into  a  temperature-controlled  enclosure  containing  all  optics  and  electronics  necessary  to  draw 
a  water  sample,  acquire  images  of  the  biosensor  strains  within  our  microfluidic  device,  analyze  this  data  using  the  embedded 
classifier  algorithm,  and  transmit  the  results  via  encrypted  Wi-Fi. 

Our  final  deliverable  is  a  self-contained  water  sensor  prototype.  Attached  are  the  Year  1  results  that  demonstrate  our  ability  to 
acquire  and  analyze  data  in  real  time  to  provide  an  accurate  and  continuous  determination  of  water  quality. 
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Major  developments: 

There  are  two  major  aspects  to  the  development  of  our  biosensor:  the  application  of 
synthetic  biology  to  develop  novel  microbial  sensor  strains  that  will  have  sensitive  and 
specific  responses  to  critical  water  toxins,  and  the  use  of  state-of-the-art  microfluidic  tech¬ 
niques  and  optical  technology  along  with  computational  biology  to  detect  and  interpret 
the  signals  from  these  toxin-sensing  organisms. 

Task  A  focuses  mainly  on  the  biological  aspect  of  this  project,  with  the  goal  of  identi¬ 
fying  combinations  of  cellular  signals  that  can  be  harnessed  to  provide  specific  responses 
to  the  presence  of  a  range  of  potential  water  toxins.  In  Ql,  we  searched  the  literature  to 
identify  known  cellular  signaling  pathways  responsive  to  our  toxins  of  interest  and  se¬ 
lected  several  candidate  promoters  from  a  variety  of  microbial  organisms.  We  designed 
plasmids  with  each  of  these  promoters  driving  GFP  and  contracted  these  sequences  to  be 
constructed.  As  proof  of  principle,  we  used  microfluidics  to  test  two  such  plasmids  that 
we  built  in-house.  We  subjected  these  preliminary  sensor  strains  to  various  toxin  levels 
within  a  novel  microfluidic  chip,  and  we  observed  bright  response  signals.  In  addition  to 
taking  advantage  of  known  toxin-sensitive  pathways,  we  conducted  a  program  of  Next 
Generation  Sequencing  to  greatly  expand  the  number  of  known  response  promoters  for 
each  toxin.  In  Q2,  we  developed  novel  RNA-Seq  analysis  algorithms  to  identify  specific 
differentially  expressed  genes  in  our  large  data  set.  We  located  the  promoter  regions  of 
the  most  promising  differentially  expressed  genes  and  designed  sensor  circuits  based  on 
them.  In  Q3,  we  completed  the  construction  of  plasmid-based  microbial  sensor  strains  for 
all  toxins,  based  on  promoters  identified  via  literature  searches  and  RNA-Seq.  We  used 
microfluidics  to  demonstrate  the  proper  induction  of  each  strain  by  various  levels  of  the 
relevant  toxin. 

Task  B  focuses  primarily  on  mechanical  sensor  development,  including  the  microflu¬ 
idic  device  design,  optical  technology,  and  computational  tools  required  to  translate  a 
series  of  optical  signals  from  multiple  sensor  strains  into  a  meaningful  toxin  level  deter¬ 
mination.  In  Ql,  we  developed  a  novel  microfluidic  device  to  culture  and  sequentially 
expose  an  array  of  sensing  strains  to  various  levels  of  our  toxins  of  interest  over  a  period 
of  several  weeks.  This  "gill"  chip  contains  tall  cell  traps  that  provide  a  bright  fluorescent 
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signal  from  a  large  population  of  cells.  We  also  developed  complimentary  technologies 
that  allowed  us  to  mix  concentrated  media  with  a  natural  water  source  and  to  dispose  of 
the  microbial  species  safely  upon  exiting  the  device.  In  Q2,  we  partnered  with  the  Ziva 
Corporation  to  develop  a  low  cost,  field  capable  optical  system  to  image  our  microfluidic 
devices.  We  demonstrated  that  the  capabilities  of  this  optical  system  were  comparable  to 
a  research  grade  microscope  at  low  optical  power.  To  enable  viable  long-term  storage  of 
biosensor  chips,  we  developed  a  method  to  deposit  and  freeze-dry  strains  in  a  defined  ar¬ 
ray  within  our  microfluidic  device.  We  demonstrated  the  successful  revival  of  our  strains 
after  four  weeks  of  room  temperature  storage  with  little  loss  of  viability.  Lastly,  we  inves¬ 
tigated  replacing  our  GFP  reporter  with  bacterial  luciferase  toward  the  goal  of  increasing 
SNR  by  eliminating  background  autofluorescence.  We  demonstrated  that  our  arsenic  sen¬ 
sor  modified  in  this  manner  shows  much  greater  sensitivity,  even  compared  to  analytical 
methods  approved  by  the  EPA  for  detecting  arsenic  in  drinking  water.  In  Q3,  we  used 
strain  response  data  to  train  our  classifier  to  identify  the  presence  of  each  toxin  of  interest 
in  a  water  source  of  unknown  composition  in  real  time.  In  Q4,  we  strengthened  classifier 
performance  by  acquiring  long  data  sets  (up  to  50  days)  of  both  on-  and  off-target  toxin 
exposures  within  our  microfluidic  devices. 

Task  C  aims  to  develop  technologies  required  to  make  an  inexpensive  and  robust  sen¬ 
sor  prototype  that  can  be  deployed  in  real  bodies  of  water  to  continuously  monitor  for 
toxin  contamination.  In  Ql,  we  developed  a  peristaltic  pumping  system  to  mix  water 
and  growth  media  on-chip  and  a  filter  to  prevent  clogs  in  the  microfluidic  device.  In  Q2, 
we  developed  a  comprehensive  software  package  to  control  fluid  flow,  image  acquisition, 
and  wireless  data  transfer.  We  also  constructed  a  solar  energy  harvesting  system  for  pow¬ 
ering  our  device  in  an  off-grid  field  environment.  In  Q3  and  Q4,  we  completed  the  de¬ 
velopment  of  a  functional  prototype  of  our  biosensor  for  use  in  comprehensively  testing 
our  sensor  strains.  We  integrated  the  biological,  fluidic,  and  computational  components 
into  a  temperature-controlled  enclosure  containing  all  optics  and  electronics  necessary  to 
draw  a  water  sample,  acquire  images  of  the  biosensor  strains  within  our  microfluidic  de¬ 
vice,  analyze  this  data  using  the  embedded  classifier  algorithm,  and  transmit  the  results 
via  encrypted  Wi-Fi. 

Our  final  deliverable  is  a  self-contained  water  sensor  prototype.  Below,  we  present  an 
overview  of  the  Year  1  results  that  demonstrate  our  ability  to  acquire  and  analyze  data  in 
real  time  to  provide  an  accurate  and  continuous  determination  of  water  quality. 


Problems/Issues: 
None  at  this  time. 


Accomplishment  of  Year  1  Goals: 

This  report  marks  the  conclusion  of  a  one  year  project  that  combined  Next  Genera¬ 
tion  Sequencing,  genetic  engineering,  and  microfluidic  technology  to  precisely  engineer 
a  highly  sensitive  and  specific  biosensor  platform  that  can  continuously  monitor  water 
supplies  for  the  presence  of  chemical  toxins.  The  following  reporting  update  section  will 
describe  each  of  the  project's  milestones  along  with  related  data  and  results  demonstrat¬ 
ing  their  achievement.  In  this  section  we  present  an  overview  of  the  most  relevant  results 
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Figure  1:  An  overview  of  our  sensor  prototype,  a)  We  currently  have  five  replicates  of  the  sensor  pro¬ 
totype  running  continuously  in  the  lab.  b)  Sensor  enclosure  with  front  panel  opened  to  expose  internal 
components,  bl:  Electronics  sub-enclosure,  b2:  temperature  controller,  b3:  AC  power  distribution  devices, 
b4:  tri-output  DC  power  supply,  b5:  Ziva  optical  assembly,  b6:  fan/heater,  b7:  DC  power  distribution 
block,  b8:  peristaltic  pump,  c)  Microfluidic  device  being  illuminated  with  the  GFP  excitation  LED.  d) 
Image  taken  using  Ziva  optics  of  eight  strains  growing  inside  our  microfluidic  device  and  responding  to 
induction,  e)  Representative  time  series  of  data  taken  from  one  of  the  sensors.  Die  green  bar  represents  the 
introduction  of  tracer  dye,  and  the  red  bar  represents  the  introduction  of  the  inducer,  in  this  case  mercury. 
Fluorescence  of  the  mercury -specific  strain  rises  and  then  falls  as  mercury  is  introduced  and  removed. 


that  demonstrate  the  accomplishment  of  the  ultimate  deliverable:  a  finalized  prototype 
of  a  biosensor  for  the  protection  of  water  supplies. 

An  overview  of  the  sensor  can  be  seen  in  Figure  1.  In  order  to  acquire  the  large  amount 
of  data  required  to  successfully  train  the  classifier,  we  built  five  replicates  of  the  fully  self- 
contained  device  prototype,  in  which  the  microfluidic  devices  are  mounted  and  imaged 
using  custom-designed  optics  (see  Milestone  6.9  below).  The  majority  of  the  data  for  this 
project  was  acquired  using  a  novel  microfluidic  device  capable  of  housing  eight  different 
bacterial  strains  that  receive  media  from  a  shared  source.  This  enables  the  simultaneous 
testing  of  the  responses  of  eight  different  strains  to  each  toxin  concentration,  providing 
information  on  specific  and  non-specific  responses,  which  can  be  used  to  strengthen  the 
classifier  (Fig.  Id).  We  have  recently  expanded  our  design  to  house  18  strains  and  have 
begun  taking  data  with  this  new  chip  to  provide  larger  datasets  to  the  classifier.  A  detailed 
CAD  drawing  of  this  device  can  be  seen  in  section  6.9  below  (Fig.  24).  The  18  cell  trapping 
regions  are  vacuum-loaded  from  18  downstream  fluidic  ports.  Long,  serpentine  channels 
upstream  of  the  trapping  regions  (shown  in  red)  serve  as  fluidic  buffer  zones  to  prevent 
the  cross-contamination  of  strains  while  loading.  The  toxin  concentration  is  controlled 
dynamically  by  a  peristaltic  pump  that  mixes  a  water/ toxin  combination  (representing 
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Figure  2:  8-strain  chip  induction  data  for  six  toxins,  demonstrating  the  sensor-specific  induction  of  the 
expected  strains  as  well  as  any  response  from  non-specific  strains. 


the  source  water  stream)  with  a  concentrated  medium  stock. 

Importantly,  our  ability  to  house  many  strains  on  the  same  chip  and  subject  them  to 
the  same  toxin  inductions  enables  us  to  look  not  only  at  toxin-specific  induction  but  at 
crosstalk  as  well.  In  Figure  2,  we  show  a  single  induction  time  series  plot  for  each  of  the 
following  six  toxins:  arsenic,  cadmium,  chromium(VI),  copper,  lead,  and  mercury.  Each 
induction  plot  shows  the  fluorescence  induction  ratio  (calculated  as  (mean  fluorescence 
following  induction) /(mean  fluorescence  prior  to  induction))  for  multiple  strains  being 
cultured  in  the  chip,  where  the  toxin  is  introduced  at  t  =  0  and  replaced  by  pure  water 
at  t  —  2  h.  In  each  panel,  the  time  series  data  for  the  "on-target"  responding  strain  is 
plotted  in  bold.  As  expected,  the  GFP  fluorescence  traces  for  the  "on  target"  responding 
strains  increase  with  induction  and  decrease  with  un-induction,  with  some  delay  due  to 
the  cellular  response  machinery.  Figure  2a  shows  strain  E.  coli  MG1655/As7  sensitively 
responding  to  0.2  M  arsenic,  with  E.  coli  MG1655/Pb7  exhibiting  a  smaller  response.  We 
have  discovered  in  our  experiments  that  strain  E.  coli  MG1655/Pb7  is  a  less  specific  re¬ 
sponder  that  aids  in  toxin  identification  mainly  through  combinatorics.  Figure  2b  shows 
strain  E.  coli  MG1655/Cdl  and  E.  coli  MG1655/Hg3  responding  to  0.05  cadmium, 
with  E.  coli  MG1655/Pb7  exhibiting  a  much  larger  response.  Figure  2c  shows  strain 
E.  coli  MG1655/Crll  and  E.  coli  MG1655/Co7  responding  to  0.5  yM  chromium(VI),  with 
E.  coli  MG1655/Pb7  exhibiting  an  equal  magnitude  but  more  rapid  response.  Figure  2d 
shows  strain  E.  coli  MG1655/Cul  sensitively  responding  to  5  copper,  with  E.  coli 
MG1655/Pb7  exhibiting  a  smaller  response.  Figure  2e  shows  strain  E.  coli  MG1655/Pb7 
responding  to  1.8  yM  lead,  with  E.  coli  MG1655/Cdl  and  E.  coli  MG1655/Cul  respond¬ 
ing  at  lesser  magnitudes.  Finally,  Figure  2f  shows  strain  E.  coli  MG1655/FIg3  sensitively 
responding  to  0.2  M  mercury. 

Within  the  last  two  weeks  of  the  reporting  period,  we  were  able  to  collect  toxin  re- 
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Figure  3:  18-strain  chip  induction  data  for  three  toxins,  demonstrating  the  sensor-specific  induction  of 
B.  subtilis  NCIB  36101  Amm3  for  ammonium  and  the  unique  combinations  of  responses  allowing  the  iden¬ 
tification  of  cobalt  and  lead. 


sponse  data  across  18  unique  sensing  strains  using  a  new  chip  design  that  houses  16 
E.  coli  strains  and  2  B.  subtilis  strains.  We  initially  used  this  chip  to  test  our  ammonium 
sensor,  with  a  representative  induction  shown  in  Figure  3a  and  3b.  Our  B.  subtilis  NCIB 
3610/  Amm3  construct  is  a  "lights-off"  sensor,  meaning  that  GFP  production  drops  when 
ammonium  is  introduced,  following  a  slight  increase  due  to  faster  growth  on  this  pre¬ 
ferred  nitrogen  source.  We  observe  this  behavior  in  response  to  5  ppm  NH4-N  in  Fig¬ 
ure  3b.  The  two  B.  subtilis  "gill"  regions  harboring  the  Amm3  strain  respond  sensitively 
to  ammonium,  whereas  various  strains  in  the  E.  coli  "gill"  regions  shown  in  Figure  3a 
do  not.  Additional  toxin  exposures  of  previously  untested  strains  in  this  18-strain  chip 
demonstrated  our  ability  to  sense  cobalt,  an  additional  toxin  not  included  in  our  primary 
list  of  eight.  We  discovered  that  strain  E.  coli  MG1655/Pb 7  responds  sensitively  to  cobalt, 
whereas  no  other  E.  coli  or  B.  subtilis  strain  does  (see  Fig.  3c,  d).  We  also  learned  that 
strains  E.  coli  MG1655/Zn6,  E.  coli  MG1655/Pb7,  and  E.  coli  MG1655/Pb8  respond  sen- 
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sitively  to  lead  in  descending  magnitude,  whereas  no  other  strains  do  (see  Fig.  3e,  f).  By 
combining  these  sensor  responses,  we  can  identify  cobalt  by  the  unique  response  of  Pb7 
alone  and  lead  by  the  unique  combination  of  Zn6,  Pb7,  and  Pb8  responses. 

To  date,  we  have  performed  247  on-chip  toxin  exposures  with  these  five  prototype 
sensors,  capturing  a  total  of  109,000  transmitted  light  and  fluorescence  images.  We  have 
run  a  single  microfluidic  chip  for  up  to  50  days,  which  includes  inducing  and  un-inducing 
the  strains  with  various  toxins  twice  each  day.  Figure  4  shows  the  response  of  a  single 
copper-sensitive  strain  periodically  exposed  to  toxins  and  imaged  over  a  period  of  34 
days,  where  each  row  contains  approximately  three  days  of  data.  Mean  fluorescence  of 
cells  within  the  trapping  region  is  plotted  over  time,  and  toxin  exposure  events  are  color- 
coded.  Exposures  to  "on-target"  toxins  are  shaded  red,  and  exposures  to  "off-target" 
toxins  are  shaded  grey.  Double-toxin  exposures  that  include  the  "on-target"  toxin  are 
shaded  orange.  Toxin-spiked  water  samples  were  drawn  through  peristaltic  pump  tubing 
to  the  cells  with  delays  ranging  from  1-9  hours,  depending  on  the  pump  speed  and  length 
of  tubing.  Current  long  delays  are  due  solely  to  slow  pump  rates  and  long  tubing  lengths 
in  our  initial  prototype  and  not  to  any  fundamental  characteristic  of  the  genetic  circuits 
or  microfluidic  devices.  As  such,  they  will  be  minimized  in  future  design  iterations.  To 
visualize  and  accurately  determine  the  moment  of  cellular  exposure  to  toxin,  on  1  /27  we 
began  introducing  a  20-minute  pulse  of  green  fluorescent  tracer  dye  followed  by  a  20- 
minute  pulse  of  pure  water  before  flowing  in  the  toxin.  These  green  fluorescent  pulses 
are  shaded  green. 

In  Figure  4,  all  the  "on-target"  copper  exposures  are  followed  by  a  rise  in  mean  cellu¬ 
lar  fluorescence.  This  indicates  that  the  E.  coli  MG1655/Cul  strain  responds  sensitively 
to  copper.  Conversely,  the  majority  of  "off-target"  toxin  exposures  are  not  followed  by 
a  rise  in  mean  fluorescence.  This  indicates  that  the  E.  coli  MG1655/Cul  strain  responds 
quite  specifically  to  copper.  For  comparison.  Figure  5  shows  the  response  of  the  E.  coli 
MG1655/Hg3  strain  to  various  toxin  exposures  over  the  same  34-day  period.  Our  ini¬ 
tial  "on-target"  mercury  exposures  through  2/16  were  intended  to  establish  our  sens¬ 
ing  ability.  We  subsequently  probed  the  detection  limit  of  this  strain  by  flowing  in  low 
concentrations  of  mercury,  which  did  not  elicit  a  response.  Strain  E.  coli  MG1655/FIg3 
also  exhibits  less  specificity  than  strain  E.  coli  MG1655/Cul,  as  indicated  by  rises  in 
mean  fluorescence  following  "off-target"  inductions.  This  has  not  been  problematic,  as 
we  have  identified  a  unique  combination  of  strain  responses  that  is  specific  for  mercury. 
Movies  of  the  8-strain  single-chip  experiment  analyzed  in  Figures  4  and  5  can  be  found  at 
http  :  / / biodynamics.ucsd.edu / DARPA_movies/ index _DARPA.html. 
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Figure  4:  34  days  of  continuous  fluorescence  data  from  the  E.  coli  MG1655/Cul  strain  as  it  responds  to  two 
inductions  per  day  of  various  toxins  and  concentrations. 
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Figure  5:  34  days  of  continuous  fluorescence  data  from  the  E.  coli  MG1655IHg3  strain  as  it  responds  to  two 
inductions  per  day  of  various  toxins  and  concentrations. 
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Figure  6:  Inhibition  of  cellular  esterase  activity  in  S.  cerevisiae  during  malathion  exposure. 

We  were  successful  in  transitioning  at  least  one  candidate  toxin-sensitive  promoter  for 
each  toxin  from  RNA-Seq  analysis  to  microscope  experiments  to  prototype  sensor  exper¬ 
iments  with  the  exception  of  our  initial  construct  based  on  the  single  promoter  candidate 
for  malathion  (Mall),  which  did  not  produce  a  discemable  response  upon  loading  into 
the  sensor  device.  This  surprised  us,  and  we  continued  to  generate  additional  constructs 
based  on  this  promoter  while  we  pressed  ahead  on  collecting  on-chip  response  data  for 
all  other  sensing  strains  using  the  sensor  prototypes.  After  some  investigation,  we  dis¬ 
covered  a  more  robust  malathion  detection  scheme  that  leverages  its  esterase  inhibition 
effect  in  yeast.  We  loaded  S.  cerevisiae  into  a  gill  chip  and  added  CFDA  (carboxyfluores- 
cein  diacetate)  to  the  growth  medium.  Because  CFDA  permeates  the  cell  membrane  and 
is  cleaved  by  cellular  esterases  into  a  fluorescent  product,  we  expected  the  inhibition  of 
esterase  activity  by  malathion  to  decrease  cell  fluorescence.  This  is  what  we  observed 
during  two  sequential  exposures  to  malathion  in  Figure  6.  GFP  signal  significantly  drops 
in  the  presence  of  0.725  g/1  malathion,  both  at  around  1,000  and  3,000  min,  and  subse¬ 
quently  recovers.  Because  CFDA  breaks  down  in  the  medium  over  time  thereby  increas¬ 
ing  the  background  signal,  we  introduced  fresh  medium  +  CFDA  at  1,500  and  2,000  min. 
This  explains  the  drift  in  baseline  GFP  during  this  period,  which  can  be  eliminated  in 
subsequent  experiments  by  mixing  dye  and  medium  on-chip. 

Our  successful  measurement  of  esterase  inhibition  by  malathion  in  wildtype  yeast  il¬ 
lustrates  the  power  of  our  forthcoming  whole-genome  detection  scheme,  whereby  we 
will  continually  measure  the  responses  of  2,000  E.  coli  and  S.  cerevisiae  promoters  arrayed 
in  a  microfluidic  chip.  We  expect  the  observed  esterase  inhibition  to  be  significantly  rep¬ 
resented  in  the  genomic  response  of  this  multi-strain  library  during  malathion  exposure. 
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Task  A 


Task  Objective:  Identify  a  combination  of 
cellular  signals  that  will  indicate  the  detec¬ 
tion  of  specific  targeted  chemical  agents 

Milestone:  12  months 

Metrics/Completion  Criteria: 

Milestone  1:  Create  an  initial  library  of 
transcription  based  sensors 

Milestone  2:  Massively  expand  the  library 
of  transcription  based  sensors  using  Next 
Generation  Sequencing  (NGS)  techniques 

Deliverable:  Comprehensive  list  of  candi¬ 
date  genes  in  E.  coli  that  respond  to  target 
compounds 

Reporting  Updates 

Milestone  1:  Create  an  initial  library  of 
transcription  based  sensors 

Summary:  We  have  completed  this  mile¬ 
stone.  We  created  a  list  of  toxin  respon¬ 
sive  promoters  based  on  literature  research 
in  Milestone  1.1.  We  assembled  promoters 
from  multiple  bacterial  species  along  with 
necessary  regulatory  genes  into  synthetic 
expression  constructs  in  Milestone  1.2.  We 
obviated  the  need  to  intermediately  test 
the  response  of  the  expression  constructs  in 
batch  culture  using  a  plate  reader  in  Mile¬ 
stone  1.3  by  directly  validating  within  cus¬ 
tom  microfluidic  devices  in  Milestone  1.4. 

Milestone  1.1:  Identify  3  promoters  for 
each  toxin  of  interest  based  on  literature 
searches:  We  have  identified  3  or  more  pro¬ 
moters  from  literature  for  each  toxin,  when 
available.  The  exceptions  are  copper  (2), 
lead  (1),  and  malathion  (0),  for  which  less 
than  three  promoters  are  known. 

Bacteria  that  grow  in  environments  with 
high  toxin  levels,  occurring  either  natu¬ 
rally  or  as  a  result  of  pollution,  have 


evolved  pathways  to  mitigate  their  effects 
on  cellular  metabolism.  These  resistance 
pathways  are  often  activated  by  a  spe¬ 
cific  transcription  factor  that  is  sensitive 
to  the  intracellular  concentration  of  the 
toxin.  We  searched  the  scientific  liter¬ 
ature  to  identify  well-characterized  tran¬ 
scription  factor /promoter  pairs.  Where  di¬ 
vergent  pathways  responding  to  the  same 
toxin  have  been  characterized,  we  chose 
at  least  one  representative  pathway  from 
each  major  evolutionary  clade.  Identi¬ 
fied  candidate  regulatory  mechanisms  for 
specific  toxins,  including  the  native  or¬ 
ganism  and  naturally-occurring  DNA  con¬ 
struct,  are  provided  below. 

Arsenic:  The  arsR  family  of  transcrip¬ 
tion  factors  contains  many  arsenic-sensing 
members,  including  one  found  on  the  E.  coli 
genome1  and  another  encoded  on  the  E.  coli 
R773  plasmid.2  We  selected  these  two  can¬ 
didates  for  synthesis  because  they  are  na¬ 
tive  to  E.  coli  and  have  been  used  previously 
to  construct  a  biosensor.3  We  also  synthe¬ 
sized  an  arsR  construct  from  S.  aureus. 

Cadmium:  Cadmium-responsive  tran¬ 
scription  factors  have  been  identified  from 
both  the  arsR  family4  and  the  merR  fam¬ 
ily5  of  transcriptional  regulators.  We  se¬ 
lected  a  representative  member  from  each 
group:  cadC  from  S.  aureus 6  and  cadR  from 
P.  putida.7 

Chromium(VI):  We  selected  the 

chromate-responsive  element  from  O.  trit- 
ici,  which  has  been  characterized8  and  used 
to  construct  a  highly  specific  biosensor,9  a 
related  system  from  C.  metallidurans,  and  an 
unrelated  chromate-responsive  transcrip¬ 
tion  factor  from  the  genome  of  B.  subtilis 
NCIB  3610. 10 

Copper:  We  selected  two  well-studied 
functionally-unique  copper-responsive  ele¬ 
ments  native  to  E.  coli.11  CueR  functions  as 
an  activator/repressor  of  the  merR  family, 
while  cusS/R  is  a  two-component  sensor 
system  including  a  histidine  kinase,  which 
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may  aid  in  signal  amplification. 

Lead:  The  only  lead-specific  system 

identified  in  the  literature  is  pbrR,12  found 
on  a  megaplasmid  in  C.  metallidurans.13 

Mercury:  Mercury-inducible  merR  sys¬ 
tems  have  previously  been  used  to  con¬ 
struct  biosensors  with  low  detection  lim¬ 
its,14  where  merR  functions  as  a  repressor 
in  the  absence  of  mercury  and  an  activa¬ 
tor  in  the  presence  of  mercury  We  adapt 
the  same  strategy  for  our  biosensor  plas¬ 
mids,  selecting  three  well-studied  members 
of  the  merR  family.15'16,17  Another  more 
evolutionarily  divergent  merR  protein  from 
S.  lividans  was  not  pursued  because  it  was 
shown  to  function  only  as  a  repressor.18 

Ammonium:  Sensing  ammonium  re¬ 

quires  culturing  cells  in  a  background  ni¬ 
trogen  source  that  is  less-preferred  than  am¬ 
monium.  While  nitrate  is  the  most  suitable 
background  nitrogen  source  for  this  pur¬ 
pose  due  to  its  long-term  stability  in  solu¬ 
tion,  E.  coli  MG1655  is  unable  to  assimilate 
it  under  aerobic  conditions.  Therefore,  we 
decided  to  sense  ammonium  using  the  bac¬ 
terium  B.  subtilis  NCIB  3610,  which  can  uti¬ 
lize  both  nitrate  and  ammonium  as  a  ni¬ 
trogen  source.  We  incorporated  the  native 
ammonium-sensitive  promoters  pnasA  and 
pnasB  and  the  synthetic  promoter  pspoi-tnrA 
into  plasmids  for  integration  into  a  neutral 
site  on  the  B.  subtilis  NCIB  3610  genome, 
where  they  act  as  a  second  copy  of  the  pro¬ 
moter  to  drive  GFP  expression  in  the  pres¬ 
ence  of  ammonium. 

For  completeness,  in  Table  1  we  present 
the  full  list  of  identified  candidate  con¬ 
structs.  Sensor  plasmids  containing  the 
most  promising  regulatory  candidates 
(shown  in  boldface  in  Table  1)  for  arsenic, 
cadmium,  chromium(VI),  copper,  lead,  and 
mercury  were  synthesized  by  our  cloning 
vendor  and  ported  into  E.  coli  to  drive  GFP 
expression  in  the  presence  of  the  toxin.  Syn¬ 
thesized  sensing  constructs  for  ammonium 
were  integrated  into  a  modified  version  of 


Toxin 

Source  Organism 

Gene  /  Promoter 

Arsenic 

E.  coli  plasmid 

rtrsR/parSR 

E.  coli  genome 

rtrsR/parsR 

S.  aureus  plasmid 

nrsR/parsR 

B.  subtilis  genome 

ars  R/parsR 

Cadmium 

S.  aureus  plasmid 

CfldC/pcadc 

P.  putida  genome 

CadRJ  pCadR 

S.  salivarius  genome 

cadX/ pcadx 

S.  lugdunensis  genome 

cadX/ pcadx 

Chromium(VI) 

C.  metallidurans  plasmid 

chrBl  pchrB 

O.  tritici  transposon 

chrBlpchlB 

B.  subtilis  genome 

chrSI  pchrS 

Copper 

E.  coli  genome 

CH£>R/pc0pA 

E.  coli  genome 

(c«sS/R)/pcusc 

Lead 

C.  metallidurans  plasmid 

pbrR/ppbrR 

Mercury 

E.  coli  plasmid 

JMerR/pmerR 

S.  aureus  plasmid 

rnerR/p,m.rK 

S.  marcescens  plasmid 

mej-R/pmerR 

S.  lividans  genome 

merR/ pmerR 

Ammonium 

B.  subtilis  genome 

PnasA 

B.  subtilis  genome 

PnasB 

B.  subtilis  genome 

Pspol— tnrA 

Table  1:  Identified  promoters  from  the  literature. 


the  B.  subtilis  NCIB  3610  host  strain,  where 
a  motility  gene  (hag)  and  a  biofilm  pathway 
gene  ( epsH )  were  knocked  out  for  improved 
growth  within  microfluidic  devices. 

Milestone  1.2:  Clone  the  promoters  into  an 
expression  plasmid  driving  the  production  of 
GFP:  Completed  in  Ql.  We  have  cloned  all 
promoters  identified  from  the  literature  into 
a  standardized  plasmid  expression  system. 

We  synthesized  two  standardized  plas¬ 
mid  backbones  for  testing  the  candidate 
toxin-responsive  elements  identified  in  the 
literature  in  E.  coli.  Both  vectors  include 
the  pl5A  medium-copy  origin  of  replica¬ 
tion,  a  spectinomycin  resistance  cassette  for 
selection  of  positive  transformants,  and  a 
promoter-less  GFP  insulated  by  flanking 
terminators.  One  of  the  vectors  replaces  the 
native  ribosome  binding  site  (RBS)  in  front 
of  GFP  with  a  version  known  to  produce 
high  levels  of  expression  in  E.  coli  MG1655. 
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Both  synthesis  of  the  plasmid  backbones 
and  insertion  of  the  sequences  for  candidate 
toxin-responsive  elements  into  each  version 
were  carried  out  by  Transcriptic.  The  se¬ 
quence  of  interest  was  inserted  such  that 
the  transcriptional  regulator  remains  under 
the  control  of  its  native  promoter,  with  the 
inducible  promoter  driving  expression  of 
GFP.  Because  all  DNA  constructs  were  com¬ 
pletely  synthesized,  we  were  able  to  codon 
optimize  sequences  for  improved  heterolo¬ 
gous  expression  in  E.  coli  MG1655. 

Milestone  1.3:  Validate  the  response  of  the 
promoters  using  traditional  batch  experiments 
(Note  validation  indicates  a  ratio  of  induced 
fluorescence  signal  to  uninduced  of  at  least 
3:1):  Completed  in  Ql.  Instead  of  perform¬ 
ing  preliminary  testing  in  batch,  we  val¬ 
idated  the  response  of  the  promoters  di¬ 
rectly  in  our  microfluidic  devices. 

We  expected  the  development  of  our  mi¬ 
crofluidic  devices  to  lag  the  construction  of 
our  toxin-sensing  plasmids,  thereby  requir¬ 
ing  that  initial  induction  experiments  be 
performed  using  a  fluorescent  plate  reader. 
However,  rapid  development  of  the  mi¬ 
crofluidics  allowed  us  to  run  initial  induc¬ 
tion  experiments  of  the  sensing  strains  on- 
chip,  and,  upon  successfully  inducing  sens¬ 
ing  strains  on-chip,  we  saw  no  need  to  repli¬ 
cate  experiments  in  batch  culture  for  several 
reasons.  First,  since  cellular  measurements 
are  highly  dependent  upon  growth  phase, 
and  batch  culture  cannot  provide  a  con¬ 
stant  growth  environment,  we  expect  more 
reproducible  results  from  our  microfluidic 
chemostats.  Second,  inducing  on-chip  al¬ 
lows  us  to  dynamically  control  the  inducer 
concentration  and  thereby  more  efficiently 
scan  the  induction  range  for  each  sensing 
strain.  Finally,  our  deployable  biosensor  de¬ 
vice  will  incorporate  microfluidic  culturing 
and  optical  imaging;  therefore,  microscope 
imaging  on-chip  is  more  relevant  than  read¬ 
ing  fluorescence  values  within  the  batch 
culture  wells  of  a  plate  reader. 


Milestone  1.4:  Validate  the  response  of  the  pro¬ 
moters  using  microfluidic  devices.  Quantita¬ 
tively  measure  GFP  signal  in  response  to  var¬ 
ious  relevant  levels  of  toxins  of  interest:  We 
have  validated  the  response  of  the  promot¬ 
ers  in  microfluidic  devices  for  all  toxins  of 
interest. 

In  Ql,  we  developed  a  microfluidic  de¬ 
vice  to  culture  and  induce  each  sensing 
strain  with  varying  levels  of  toxins  of  in¬ 
terest.  Termed  the  "gill  chip,"  this  device 
is  a  variation  on  the  "biopixel"  device  pre¬ 
viously  developed  in  our  lab.  Structurally, 
this  microfluidic  device  consists  of  a  poly- 
dimethylsiloxane  (PDMS)  elastomer  block 
with  recessed  channels  that  are  sealed  upon 
bonding  to  a  glass  cover  slip.  Fluidically, 
the  core  functional  unit  of  the  device  is 
a  microfluidic  channel  with  long,  narrow 
branches  along  the  sides  that  serve  to  retain 
cultures  of  cells  (Fig.  7).  The  precise  geome¬ 
try  of  the  "cell  trap"  creates  high-resistance, 
low-flow  regions  where  cells  grow  in  a  de¬ 
fined  area  with  continuous  perfusion  of 
fresh  medium.  Previous  studies  of  syn¬ 
thetic  gene  circuits  required  microfluidic 
devices  which  grow  cells  in  a  monolayer  in 
order  to  perform  single-cell  tracking.  The 
gill  chip  uses  a  similar  side  trap  geometry, 
but  the  trap  width  is  decreased  from  100  pm 
to  10  pm,  and  the  trap  height  is  increased 
from  1.65  pm  to  50  pm.  By  using  thin,  tall 
traps,  the  optical  signal  is  increased  while 
maintaining  the  ability  to  selectively  grow 
cells  in  a  low-flow  trap  region.  The  overall 
device  consists  of  an  array  of  trapping  units 
connected  in  parallel,  each  supporting  the 
long-term  culturing  of  a  cell  strain.  The  cen¬ 
tral  channels  of  the  trapping  regions  are  fed 
by  a  single  switchable  medium  source. 

We  optimized  the  length  of  the  cell¬ 
trapping  channels  of  the  gill  chip  to  maxi¬ 
mize  fluorescent  signal  while  ensuring  ad¬ 
equate  medium  delivery  for  healthy  cell 
growth.  Medium  delivery  to  the  packed 
cells  in  the  trapping  channels  is  limited 
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Figure  1:  Functional  unit  of  the  microfluidic  " gill 
chip."  Medium  is  delivered  via  the  main  channel, 
and  cells  are  cultured  in  the  branched  channels. 


by  diffusion,  and  we  observed  that  exces¬ 
sively  long  channels  slowed  cell  growth 
and  GFP  production.  We  determined  the 
optimal  trap  length  by  fabricating  multi¬ 
ple  versions  in  parallel  and  measuring  ex¬ 
pressed  fluorescence  from  a  preliminary 
arsenic-sensing  strain.  In  Figure  8,  we  see 
that  a  120  }im  channel  length  produces  the 
largest  fluorescent  signal  for  arsenic  lev¬ 
els  ranging  from  below  lx  to  ~10x  the 
EPA  limit.  Therefore,  we  standardized 
our  trap  length  in  future  experiments  at 
120  }tm.  We  confirmed  that  representa¬ 
tive  gram-negative  (E.  coli  MG1655,  S.  Ty- 
phimurium)  and  gram-positive  (B.  subtilis 
NCIB  3610)  bacterial  strains  as  well  as  yeast 
(S.  cerevisiae  MFSC120)  form  stable  cultures 
in  the  gill  chip  with  120-^m-long  trapping 
channels.  Initial  experiments  continuously 
growing  E.  coli  MG1655  in  LB  medium  in 
the  gill  chip  demonstrated  a  device  lifetime 
of  27  days  before  flow  stopped  due  to  chan¬ 
nel  clogging.  This  lifetime  was  later  ex¬ 
tended  to  50  days  by  slowing  the  cellular 
growth  rate  by  replacing  LB  medium  with 
M9  minimal  medium. 

While  waiting  for  Transcriptic  to  syn¬ 
thesize  our  initial  library  of  toxin-sensitive 
plasmids  based  on  our  literature  search,  we 
used  our  microfluidic  device  to  measure  the 


Figure  8:  Optimization  of  cell  trap  length  based  on 
GFP  expression  level.  120-pm-long  traps  provide 
the  greatest  fluorescent  signal  for  all  arsenic  concen¬ 
trations. 

response  of  two  toxin-sensing  promoters  in 
E.  coli  MG1655  generated  in  our  lab  for  pre¬ 
vious  work. 

We  first  tested  induction  of  pRS18,  an 
arsenic-sensing  plasmid,  in  our  microflu¬ 
idic  device.  A  recombinant  strain  of  E.  coli 
MG1655  containing  pRS18  was  vacuum- 
loaded  into  the  microfluidic  device,  grown 
to  confluence,  and  step-induced  with 
sodium  ortho-arsenite  (NasAsCh)  in  M9 
minimal  medium  supplemented  with  0.4% 
glucose.  Images  were  collected  using  a 
Nikon  Ti  microscope  with  4x  objective 
magnification.  Step  induction  was  exe¬ 
cuted  using  our  gravity-driven  "Dial-A- 
Wave"  automated  flow  control  system  to 
mix  two  medium  sources  in  a  defined  ra¬ 
tio.  The  cells  were  sequentially  exposed  to 
arsenic  concentrations  of  0  ^M,  0.1  //M,  0.5 
and  1  in  6-h  windows,  allowing 
time  for  the  cells  to  respond  to  each  con¬ 
centration  increase.  The  data  shows  a  sig¬ 
nificant  detection  of  0.1  arsenic,  which 
is  below  the  EPA  limit  of  0.13  ^M.  Figure 
9  shows  the  trapped  cells  prior  to  (panel 
a)  and  following  (panel  b)  6  h  of  induction 
at  0.1  arsenic.  We  believe  that  the  rise 
in  GFP  expression  prior  to  the  induction 
step  is  due  to  a  small  leak  of  inducer  into 
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Figure  9:  Response  of  arsenic-sensing  plasmid 
pRS18  (from  previous  work)  to  step-induction  with 
arsenic.  Images  before  (a)  and  after  (b)  induction 
with  0.1  pM  arsenic,  c)  Measured  GFP  expression 
over  time  for  induction  with  0.1  pM  arsenic. 


the  medium  stream.  Average  fluorescence 
within  the  trapping  regions  of  the  images  as 
well  as  arsenic  concentration  is  plotted  over 
time  in  Figure  9c. 

Second,  we  tested  induction  of  pCue- 
CopA,  a  copper-sensing  plasmid,  in  our  mi¬ 
crofluidic  device.  A  recombinant  strain  of 
E.  coli  MG1655  containing  pCueCopA  was 
vacuum-loaded  into  a  microfluidic  device, 
grown  to  confluence,  and  step-induced 
with  copper  sulfate  (CuSCh)  in  LB  medium. 
Images  were  collected  using  an  Etaluma 
LumaScope  microscope  with  20  x  objective 
magnification.  Step  induction  was  again 
performed  using  our  "Dial-A-Wave"  auto¬ 
mated  flow  control  system.  The  cells  were 
sequentially  exposed  to  copper  concentra¬ 
tions  of  0  yM,  25  yM,  52  yM,  77  yM,  and  99 
jiM  in  6-h  windows.  Figure  10  shows  a  sig¬ 
nificant  detection  of  25  //M  copper,  which  is 
slightly  above  the  EPA  limit  of  20.5  fiM. 


Figure  10:  Response  of  copper-sensing  plasmid 
pCueCopA  (from  previous  work)  to  step-induction 
with  copper.  Measured  GFP  expression  over  time  for 
induction  with  25  pM  copper  demonstrates  that  we 
can  detect  copper  concentrations  near  the  EPA  limit. 


In  Q2,  we  received  our  literature-based 
library  of  synthesized  sensor  plasmids  from 
Transcriptic  and  transformed  them  into 
both  E.  coli  MG1655  and  E.  coli  LABEC01 
(See  Detailed  Methods  for  strain  details)  for 
on-chip  testing.  To  increase  the  throughput 
of  our  microfluidic  experiments,  we  devel¬ 
oped  a  version  of  the  gill  chip  with  four 
independently  loadable  cell  growth  areas 
for  culturing  four  sensing  strains  in  paral¬ 
lel  (see  Fig.  11).  These  growth  areas  are 
fed  from  the  output  of  an  on-chip  "Dial- 
A-Wave"  system  to  dynamically  control  the 
local  toxin  concentration. 

For  on-chip  induction  experiments, 
strains  were  loaded  into  the  device  and 
grown  in  M9  minimal  medium  with  0.4% 
glucose  and  50  fig/ ml  spectinomycin  (for 
plasmid  maintenance)  for  2  days  in  the 
absence  of  toxin.  Cells  were  imaged  under 
4x  brightfield  and  GFP  fluorescence  filters 
on  an  Olympus  IX-81  microscope  every  2 
min  for  6  h  under  these  non-inducing  con¬ 
ditions  to  generate  a  fluorescence  baseline. 
We  then  introduced  the  toxin  to  induce 
GFP  expression  and  imaged  the  cells  every 
2  min  for  6  h. 
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Figure  11:  Gill  chip  v9-C  for  parallelizing  the  on- 
chip  testing  of  toxin-responsive  promoters.  Four 
cell  trapping  regions  (shown  in  blue)  are  vacuum- 
loaded  from  four  downstream  fluidic  ports.  Long, 
serpentine  channels  upstream  of  the  trapping  re¬ 
gions  (shown  in  red)  serve  as  fluidic  "buffer  zones " 
to  prevent  the  cross-contamination  of  strains.  The 
three-port  "Dial-A-Wave"  module  at  the  left  serves 
to  mix  two  medium  streams  to  precisely  and  dynam¬ 
ically  define  the  toxin  concentration  in  the  cell  trap¬ 
ping  regions. 


To  determine  the  response  to  toxin  in¬ 
duction,  we  compared  the  raw  fluorescence 
images  before  and  after  induction.  The  im¬ 
age  stacks  were  analyzed  using  a  custom 
ImageJ  script  to  measure  the  average  GFP 
signal  over  time  for  each  cell  trap.  Any 
linear  trend  in  the  uninduced  fluorescence 
data  due  to  cell  growth  was  subtracted  from 
the  data.  In  this  manner,  the  dynamic 
signal-to-noise  ratio  (SNR)  was  calculated 
as:  (current  fluorescence  -  mean  uninduced 
fluorescence)/ (standard  deviation  of  unin¬ 
duced  fluorescence). 

We  used  this  microfluidic  platform  to 
measure  the  response  of  several  toxin- 
responsive  promoter  constructs  identified 
from  the  literature  and  synthesized  by  Tran- 
scriptic.  Table  2  summarizes  the  results  of 
these  experiments,  listing  the  most  sensitive 
promoter  constructs  exposed  to  each  toxin 
and  the  calculated  SNR  values  after  6  h  of 
induction.  Note  that  SNR  values  greater 
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Figure  12:  On-chip  time-lapse  induction  responses 
of  promoter  constructs  identified  from  the  litera¬ 
ture,  expressed  in  E.  coli,  and  summarized  in  Table  2. 
a)  MG1655/As3,  MG1655/pRS18,  and  MG1655/Asl 
show  high  sensitivity  to  0.13  pM  arsenic.  (Note 
that  pRS18,  a  plasmid  generated  in  our  lab  for 
previous  work,  shares  the  same  promoter  as  Tran- 
scriptic  plasmid  As7.)  b)  MG1655ICdl  shows  high 
sensitivity  to  0.04  pM  cadmium,  c)  LABEC01/Cr5 
shows  high  sensitivity  to  5  pM  chromium(VI).  d) 
MG1655ICul  shows  high  sensitivity  to  25  pM  cop¬ 
per.  e)  LABEC01IPb2  shows  high  sensitivity  to  7  pM 
lead,  f)  MG1655/Hg3  shows  high  sensitivity  to  0.1 
pM  mercury. 


than  ~2  indicate  a  significant  response. 

The  on-chip  time-lapse  induction  re¬ 
sponse  of  each  promoter  construct  in  this 
table  can  be  seen  in  Figure  12.  In  each  figure 
panel,  the  moment  of  toxin  introduction  is 
represented  as  a  step  in  the  red  trace.  While 
some  promoter  constructs  exhibit  a  sensi¬ 
tive  response  by  highly  expressing  GFP  im¬ 
mediately  following  induction,  others  do 
not.  Note  that  the  arsenic  sensor  plasmid 
As3  was  used  as  a  negative  control  for  ex¬ 
periments  with  non-arsenic-sensing  strains, 
and  it  showed  no  induction  in  the  presence 
of  other  heavy  metals. 


16 


Toxin 

Gene/ 

Promoter 

Source 

RBS 

Host  Strain/ 

Plasmid 

Concentration 

Sensed  in 

Microfluidic 
Device  (;/M) 

SNR 

after 

6  h 

Refer  to 
Figure 
Panel 

Arsenic 

«rsR/parsR 

E.  coli  plasmid 

native 

E.  coli  MG1655/Asl 

0.13 

20 

12a 

Arsenic 

arsR/ ParsR 

E.  coli  genome 

synthetic 

E.  coli  MG1655/As3 

0.13 

33 

12a 

Cadmium 

CCldC.  /  PcadC 

S.  aureus  plasmid 

native 

E.coli  MG1655/Cdl 

0.04 

17 

12b 

Chromium(VI) 

chrB/ pchrB 

O.  tritici  transposon 

native 

E.  coli  LABEC01  /Cr5 

5 

5 

12c 

Copper 

cueR/ pcopA 

E.  coli  genome 

native 

E.  coli  MG1655/Cul 

25 

65 

12d 

Lead 

pbrR/ ppbrR 

C.  metallidurans  plasmid 

synthetic 

E.  coli  LABEC01/Pb2 

7 

18 

12e 

Mercury 

nierR/ PmerR 

E.  coli  plasmid 

synthetic 

E.  coli  MG1655/Hg3 

0.1 

20 

12f 

Table  2:  Toxin-responsive  promoter  constructs  identified  from  the  literature,  synthesized  by  Transcriptic, 
cloned  into  E.  coli,  and  demonstrating  high  sensitivity  in  the  microfluidic  device,  ordered  by  toxin.  The 
promoter  source  and  RBS  used  in  the  synthetic  construct  are  shown  alongside  the  toxin  concentration 
sensed  and  SNR  after  6  h.  Refer  to  Figure  12  for  on-chip  time-lapse  induction  responses. 


Toxin  Cone  (;<M)  Strain/ On-Target  Toxin 


As3/ 

Cdl/ 

Cr5/ 

Cul/ 

Pb2/ 

Hg3/ 

Arsenic 

Cadmium 

Chromium(VI) 

Copper 

Lead 

Mercury 

Arsenic 

1.3 

1.00 

-0.18 

0.24 

-0.05 

0.33 

0.25 

Cadmium 

0.44 

-0.13 

1.00 

-0.02 

-0.08 

0.46 

-0.26 

Chromium(VI) 

2 

-0.22 

0.62 

1.00 

-0.06 

1.69 

-0.96 

Copper 

20 

-0.09 

-0.08 

1.00 

-0.17 

0.16 

Lead 

1 

-0.05 

0.06 

0.01 

-0.05 

1.00 

0.12 

Mercury 

0.1 

-0.18 

-0.09 

0.58 

-0.05 

1.44 

1.00 

Cobalt 

20 

0.30 

0.67 

-0.11 

-0.01 

0.80 

Iron 

500 

-0.06 

-0.31 

0.86 

0.17 

0.96 

-0.38 

Nickel 

1.7 

-0.08 

-0.08 

-0.35 

-0.05 

0.16 

Zinc 

7.6 

-0.14 

-0.09 

0.35 

-0.08 

1.06 

0.78 

Table  3:  Specificity  results  for  toxin-sensing  plasmids  identified  from  the  literature  and  cloned  into  syn¬ 
thetic  constructs  expressed  in  E.  coli.  Here,  strains  representing  the  most  sensitive  promoters  from  our 
microfluidic  induction  experiments  were  grown  in  the  wells  of  a  microplate  in  the  presence  of  on-  and  off- 
target  heavy  metals.  Each  column  represents  the  fluorescence  response  of  a  strain  to  the  on-target  metal 
(normalized  to  "1")  and  all  off-target  metals,  where  " 0 "  represents  the  unchanged  response  for  the  no-toxin 
control.  Although  these  promoters  are  generally  specific  to  the  on-target  toxin,  some  crosstalk  is  evident 
(i.e.  05  is  sensitive  to  iron;  Hg3  is  sensitive  to  chromium(VI)  and  zinc;  Pb2  is  sensitive  to  several  off-target 
toxins). 
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To  investigate  the  specificity  of  the  most 
sensitive  promoter  construct  for  each  toxin 
identified  from  the  literature,  cloned  into 
synthetic  constructs  expressed  in  E.  coli,  and 
measured  on-chip  (see  Table  2),  we  grew 
these  strains  in  the  wells  of  a  microplate 
in  the  presence  of  on-  and  off-target  heavy 
metals.  Although  the  ability  to  sense  cobalt, 
iron,  nickel,  and  zinc  is  of  secondary  impor¬ 
tance,  we  included  these  heavy  metals  as 
potential  off-target  inducers  because  they 
may  be  present  in  the  natural  water  sup¬ 
ply  under  test.  Each  column  of  Table  3 
shows  the  fluorescence  response  of  a  strain 
to  the  on-target  metal  (normalized  to  "1") 
and  all  off-target  metals,  where  "0"  repre¬ 
sents  the  unchanged  response  for  the  no¬ 
toxin  control.  Although  these  promoters 
are  generally  specific  to  the  on-target  toxin, 
some  crosstalk  is  evident.  Fortunately,  in 
cases  of  significant  crosstalk  ( e.g .  Pb2  lead¬ 
sensing  strain),  nonspecific  promoter  re¬ 
sponses  can  be  combinatorially  combined 
with  other  promoter  responses  to  gener¬ 
ate  specific  multi-promoter  responses  (see 
Fig.  15  and  Appendix  B).  This  combinato¬ 
rial  logic  was  implemented  in  algorithms 
developed  in  Q3-Q4.  Additionally,  our 
literature-identified  promoters  were  sup¬ 
plemented  with  highly  specific  promot¬ 
ers  via  RNA-Seq  analysis  (e.g.  ybil  lead- 
responsive  promoter). 

Milestone  2:  Massively  expand  the  li¬ 
brary  of  transcription  based  sensors  using 
Next  Generation  Sequencing  (NGS)  tech¬ 
niques 

Summary:  We  have  completed  this  mile¬ 
stone.  We  expanded  upon  our  proposed 
work  by  exposing  multiple  strains  of  bac¬ 
teria  (in  addition  to  E.  coli)  to  relevant  lev¬ 
els  of  all  toxins  and  extracted  RNA  in  Mile¬ 
stone  2.1.  To  ensure  that  the  biosensor  will 
have  high  selectivity  for  the  toxins  of  in¬ 
terest,  we  exposed  the  sensing  bacteria  to 
multiple  toxins  at  once  and  extracted  RNA 


in  Milestone  2.2.  We  obviated  the  need  for 
testing  at  multiple  temperatures  by  ensur¬ 
ing  tight  temperature  control  in  our  proto¬ 
type  enclosure  in  Milestone  2.3.  We  opti¬ 
mized  our  microfluidic  devices  to  achieve 
growth  rates  similar  to  batch  culture  in 
Milestone  2.4.  We  prepared  isolated  RNA 
and  generated  sequence  data  in  Milestone 
2.5.  We  analyzed  the  sequencing  data  for 
all  toxins  and  discovered  numerous  differ¬ 
entially  expressed  genes  in  Milestone  2.6. 

Milestone  2.1:  Expose  target  cells  to  toxins  of 
interest  over  four  orders  of  magnitude  of  concen¬ 
tration  and  extract  RNA:  Completed  in  Ql. 
We  have  exposed  three  strains  of  bacteria 
to  relevant  levels  of  all  toxins  and  extracted 
RNA. 

Our  toxin  exposure  protocol  involves 
first  growing  a  culture  of  bacterial  cells  to 
mid-log  phase  (OD<0.2).  Second,  we  di¬ 
lute  with  fresh  media,  add  the  toxin  of  in¬ 
terest,  and  culture  at  37°  C  for  3  h,  ensur¬ 
ing  that  the  cells  do  not  exit  exponential 
growth  phase  before  harvesting  (OD<0.25). 
Finally,  we  stabilize  cellular  RNA  using  Qi- 
agen  RNA  Protect  reagent,  centrifuge  the 
cells,  and  freeze  the  pellet  at  -80°C. 

To  investigate  the  cellular  RNA  response 
to  toxin  insult,  we  exposed  E.  coli  MG1655, 
E.  coli  FABEC01,  and  B.  subtilis  168  cells 
to  the  toxins  and  concentrations  shown  in 
Table  4  and  extracted  RNA.  In  determin¬ 
ing  the  toxin  concentrations  for  exposure, 
we  began  with  the  EPA  limit.  We  gener¬ 
ated  growth  curve  data  for  E.  coli  MG1655 
cells  exposed  to  each  toxin  at  levels  around 
the  EPA  limit  (Fig.  13)  and,  combined  with 
our  induction  data,  found  that  the  concen¬ 
tration  range  of  interest  between  the  detec¬ 
tion  limit  and  cell  death  is  generally  less 
than  4  orders  of  magnitude.  Surprisingly, 
toxin  levels  in  this  range  tend  not  to  slow 
down  growth  rate  but  instead  delay  the  on¬ 
set  of  the  exponential  growth  phase,  as  il¬ 
lustrated  by  copper  exposure  in  Figure  13a. 
Figure  13b  shows  that  the  dependence  of 
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Toxin  P/S  Exposure  Cone.  (fiM) 


E.  coli  E.  coli  B.  subtilis 

MG 1655  LABEC01  168 


Ammonium 

n/a 

71.4,  714 

Arsenic 

P 

0.25, 1 

0.1,1 

Cadmium 

P 

0.4,  1.2 

0.04,  0.4 

Chromium(VI) 

P 

0.2,2 

0.2,2 

Cobalt 

S 

20,  200 

Copper 

P 

0.2,  2,  20 

0.2,2 

Iron 

s 

5.4,  54 

Lead 

p 

0.1,3 

0.1,3 

Malathion 

n/a 

21.9,  219 

21.9,  219 

Mercury 

P 

0.01,  0.1 

0.01,  0.1 

Nickel 

n/a 

0.77,  7.7 

Zinc 

S 

76,  760 

Table  4:  Concentrations  used  for  single-toxin  expo¬ 
sures  of  bacterial  strains  to  investigate  the  cellular 
RNA  response  for  sensitivity.  Most  water  toxins  to 
be  sensed  by  our  device  (boldfaced)  are  identified  by 
the  EPA  as  being  of  primary  (P)  importance.  Sens¬ 
ing  is  expected  to  occur  in  the  presence  of  interfering 
substances  of  secondary  (S)  importance  (not  bold¬ 
faced). 


growth  phase  lag  on  copper  concentration 
is  approximately  linear.  If  cells  exposed 
to  a  toxin  at  the  EPA  limit  did  not  exhibit 
this  growth  phase  lag  (see  growth  rates  for 
mercury  exposure  in  13c),  we  assumed  that 
they  were  not  at  risk  of  death  and  probed 
additional  toxin  concentrations  that  were 
higher.  If  cells  exposed  to  a  toxin  at  the  EPA 
limit  exhibited  this  growth  phase  lag  (see 
growth  rates  for  chromium(VI)  exposure  in 
13d),  we  assumed  that  they  were  at  risk  of 
death  and  probed  additional  toxin  concen¬ 
trations  that  were  lower.  Our  general  strat¬ 
egy  was  to  examine  the  cellular  RNA  re¬ 
sponse  at  toxin  levels  where  the  cells  are  re¬ 
sponding  specifically  to  the  presence  of  the 
toxin  without  triggering  a  universal  (and 
unspecific)  cellular  stress  response. 

Milestone  2.2:  Expose  target  cells  to  multiple 
toxins  at  once  and  extract  RNA  to  determine 
selectivity  of  sensor  response  and  use  computa¬ 
tional  algorithms  to  determine  a  highly  accurate 
relationship  between  cellular  signals  and  sensor 


Figure  13:  Batch  growth  rate  data  for  E.  coli 
MG1655  cells  exposed  to  a)  copper,  c)  mercury,  and 
d)  chromium(VI).  Panel  b)  shows  the  approximately 
linear  scaling  of  lag  prior  to  exponential  growth 
phase  with  copper  concentration. 


response:  We  have  exposed  cells  to  multiple 
toxins  at  once  and  determined  that  the  pro¬ 
moters  we  identified  respond  as  expected, 
even  in  the  presence  of  an  additional  toxin. 

We  have  performed  six  RNA-Seq  exper¬ 
iments  with  As+Cd,  As+EIg,  and  Cd+Hg 
in  E.  coli  MG1655  and  LABEC01  to  in¬ 
vestigate  the  specificity  of  the  expression 
responses.  Concentrations  used  for  each 
multiple-toxin  exposure  are  shown  in  Ta¬ 
ble  5.  For  this  subset  of  toxin  combina¬ 
tions,  only  those  RNA-Seq  promoter  candi¬ 
dates  responding  to  each  individual  toxin 
also  respond  to  the  same  toxin  in  combi¬ 
nation  with  others  (See  the  rightmost  three 
conditions  in  Fig.  14  and  additional  figures 
in  Appendix  B).  To  avoid  the  complexity 
of  using  this  RNA-Seq  approach  combina- 
torically  to  investigate  specificity  across  all 
toxin  combinations,  we  concluded  that  the 
most  efficient  approach  is  to  develop  sen¬ 
sor  strains  based  on  RNA  data  extracted  fol¬ 
lowing  single-toxin  exposures  and  then  ex¬ 
pose  these  strains  to  other  toxins  to  look  for 
non-specific  responses. 

Because  we  have  several  strains  that  re- 
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Toxin  Combination 

Exposure  Cone.  (;/M) 

E.  coli  MG  1655 

E.  coli  LABEC01 

Arsenic/Cadmium 

0.25/0.4 

0.1/0.04 

Arsenic /Mercury 

0.25/0.1 

0.1/0.01 

Cadmium/  Mercury 

0.4/0.1 

0.04/0.01 

Table  5:  Concentrations  used  for  multiple-toxin  ex¬ 
posures  of  E.  coli  strains  to  investigate  the  speci¬ 
ficity  of  the  cellular  RNA  response. 


spond  to  each  toxin,  even  for  some  over¬ 
lap  in  response  between  certain  promoters 
and  multiple  toxins,  we  can  use  computa¬ 
tional  analysis  to  determine  a  disjoint  set  of 
genes,  and  therefore  a  unique  response  sig¬ 
nature,  for  each  toxin  of  interest.  In  Q3-Q4 
we  exposed  the  prototype  device  contain¬ 
ing  an  array  of  sensing  strains  to  each  toxin 
and  used  the  GFP  "fingerprint"  (i.e.  which 
strains  are  responding  at  what  intensities) 
to  train  a  robust  computational  algorithm 
to  decode  this  fingerprint  into  the  toxin  and 
concentration  present  in  the  sample. 

Milestone  2.3:  Repeat  above  experiments 
at  25°C,  30°C,  and  37°C:  Completed  in 
Ql.  All  biosensing  organisms  grow  opti¬ 
mally  at  37°C.  Upon  investigating  the  abil¬ 
ity  to  maintain  a  constant  on-chip  tempera¬ 
ture  (Task  C),  we  have  elected  to  design  the 
sensor  with  temperature  control,  as  this  im¬ 
plementation  will  be  simple  and  will  elim¬ 
inate  the  risk  of  any  temperature-related 
sensitivity  or  specificity  issues. 

Milestone  2.4:  Determine  how  growth  system 
affects  gene  expression  -  compare  batch  grown 
cells  to  microfluidic  grown  and  ensure  proper 
growth  and  response  of  cells  to  toxins  in  de¬ 
vice  environment:  Completed  in  Ql.  We 
have  developed  the  microfluidic  device  and 
confirmed  that  the  cells  in  the  chambers 
are  healthy,  expressing  normally,  and  their 
growth  rates  are  comparable  to  those  ob¬ 
served  in  batch  conditions. 

A  simple  method  to  indicate  healthy  cell 
growth  on-chip  is  to  compare  estimated  mi¬ 


crofluidic  growth  rate  with  calculated  batch 
growth  rate.  We  grew  E.  coli  MG1655  cells 
in  M9  minimal  medium  supplemented  with 
0.4%  glucose  in  the  plate  reader  and  ex¬ 
tracted  a  doubling  time  of  0.93  h  from  mea¬ 
surements  of  OD600  over  time.  We  grew 
cells  in  the  microfluidic  device  with  the 
same  medium  and  estimated  the  growth 
rate  by  collecting  the  effluent  exiting  the  de¬ 
vice,  serially  diluting  it,  and  plating  it  on  LB 
medium  in  agar  to  determine  the  viable  cell 
count.  Using  this  data,  we  determined  an 
on-chip  doubling  time  of  0.94  h,  which  is  in 
excellent  agreement  with  batch  data.  Cer¬ 
tainly,  our  batch  and  on-chip  growth  rates 
are  comparable,  and  from  our  extensive  mi¬ 
crofluidics  experience  we  observe  with  con¬ 
fidence  that  the  on-chip  cells  are  healthy 
and  expressing  normally. 

Milestone  2.5:  Prepare  isolated  RNA  for 
Illumina  sequencing  using  vendor  provided 
reagents  and  carry  out  sequencing:  Completed 
in  Q3.  We  have  prepared  libraries  and  car¬ 
ried  out  sequencing  for  all  toxins  of  interest. 

After  exposing  bacterial  cultures  to  var¬ 
ious  levels  of  toxins,  we  extracted  the  cel¬ 
lular  RNA  and  prepared  it  for  sequencing. 
First,  we  thawed  the  cell  pellet,  homoge¬ 
nized  the  pellet  using  bead  beating  with 
RNase-free  zirconium  oxide,  and  extracted 
the  RNA  using  Qiagen  RNeasy  Kits.  Sec¬ 
ond,  we  removed  any  contaminating  ge¬ 
nomic  DNA  with  DNase  and  performed  a 
final  purification  step  using  a  Zymo  Clean 
and  Concentrator  column.  Third,  we  pre¬ 
pared  the  RNA  library  for  sequencing  by 
enriching  for  mRNA  using  Epicentre  Ribo- 
Zero  rRNA  removal  kits.  Fourth,  we  gen¬ 
erated  cDNA  and  prepared  an  indexed  Il¬ 
lumina  library  using  NEBNext  Ultra  Direc¬ 
tional  RNA  Library  Prep  Kits,  which  re¬ 
tain  strand-specific  orientation  information. 
Finally,  we  prepared  and  loaded  these  li¬ 
braries  on  our  Illumina  MiSeq  instrument 
for  2  x  75  bp  paired-end  sequencing. 
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Toxin 

Number  DE  Genes 

Low  Cone. 

High  Cone. 

Arsenic 

3 

14 

Cadmium 

132 

38 

Chromium(VI) 

270 

377 

Cobalt 

609 

1274 

Copper 

110 

77 

Iron 

32 

70 

Lead 

99 

489 

Malathion 

370 

41 

Mercury 

165 

218 

Nickel 

43 

95 

Zinc 

24 

596 

Table  6:  Number  of  differentially  expressed  (DE) 
genes  at  a  false  discovery  rate  below  1%  for  each 
toxin  exposure  condition  of  interest. 


Milestone  2.6:  Analyze  data  to  determine  can¬ 
didate  genes  which  are  induced  upon  toxin  ex¬ 
posure:  Completed  in  Q3.  We  analyzed 
our  sequencing  data  and  identified  signif¬ 
icant  numbers  of  candidate  genes  for  each 
toxin  of  interest  at  false  discovery  rates  be¬ 
low  1%.  As  we  explain  in  Milestone  3.3, 
we  experimentally  validated  selected  can¬ 
didate  genes  for  cobalt  detection. 

Our  analysis  of  the  sequencing  data 
from  RNA-Seq  experiments  to  determine 
candidate  genes  that  are  induced  upon 
toxin  exposure  consisted  of  three  main 
tasks:  sequence  alignment,  quantification 
of  gene  expression,  and  identification  of 
differentially  expressed  (DE)  genes.  We 
used  and  rigorously  tested  established  soft¬ 
ware  and  also  developed  our  own  software 
to  corroborate  and  verify  all  the  candidate 
DE  genes  (see  Table  6).  We  took  into  ac¬ 
count  the  stranding  of  the  MiSeq  reads  to 
achieve  greater  than  95%  alignment  with 
the  MG 1655  genome.  Details  of  our  quan¬ 
tification  of  the  DE  genes  are  provided  in 
Appendix  B. 

The  resulting  genes  were  cross- 
inspected  using  different  methods  to 
confirm  that  the  most  specific  genes  were 
selected.  In  order  to  simultaneously  ana¬ 
lyze  mean  fold  change  with  respect  to  the 


Arsenic  sensing  gene  arsR(E.  coli  str.  K-12  substr.  MG1655) 
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Figure  14:  Response  of  the  arsR  promoter  in  E.  coli 
MG1655  for  specifically  sensing  arsenic.  Note  that 
the  arsR  promoter  responds  specifically  and  mono- 
tonically  to  increasing  concentrations  of  arsenic 
alone  and  in  combination  with  other  heavy  metals 
(rightmost  conditions)  without  exhibiting  crosstalk. 
Plots  represent  the  mean  fold  change,  the  normal¬ 
ized  counts  (taking  into  account  both  the  library 
depth  and  the  gene  length),  and  the  "%CDF  expres¬ 
sion  level."  This  measure  represents  the  position  of 
the  gene  in  the  Cumulative  Distribution  Function 
(CDF)  obtained  from  the  normalized  counts  for  a 
given  experimental  condition;  that  is,  a  large  CDF 
indicates  a  high  expression  level  for  the  gene  com¬ 
pared  to  the  other  genes  in  the  same  experiment. 
Replicate  samples  for  each  toxin  are  shown  in  the 
same  color.  Control  samples  for  each  batch  of  ex¬ 
periments  are  shown  in  white. 

negative  conditions,  the  normalized  counts, 
and  the  expression  level  with  respect  to 
other  genes,  we  implemented  a  toolbox 
that  depicts  a  summary  figure  for  a  gene 
candidate  as  shown  in  Figure  14  (see  the 
full  set  of  gene  candidates  in  Appendix  B). 
From  an  analysis  of  all  gene  candidates, 
we  set  our  mean  fold  change  threshold  to 
indicate  DE  to  2.5. 

Our  most  sensitive  and  specific  gene 
candidates  for  eight  toxins  identified  via 
RNA-Seq  analysis  in  E.  coli  MG1655  are 
shown  in  Figure  15.  The  arsR  promoter  in 
Figure  15a  responds  specifically  and  mono- 
tonically  to  increasing  concentrations  of  ar¬ 
senic  alone  and  in  combination  with  other 
heavy  metals  without  exhibiting  crosstalk. 
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E.  coli  str.  K-12  substr.  MG  1655 
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Figure  15:  Sensitive  and  specific  gene  candidates  for  eight  toxins  identified  by  RNA-Seq  analysis  in  E.  coli 
MG1655,  where  the  mean  fold  change  threshold  to  indicate  DE  has  been  set  to  2.5. 


This  gene  serves  as  an  excellent  positive 
control  for  our  analysis  methods,  as  our  lit¬ 
erature  search  revealed  it  to  be  the  most 
widely  studied  gene  for  sensing  arsenic 
in  E.  coli.  The  zntA  promoter  in  Fig¬ 
ure  15b  responds  monotonically  to  increas¬ 
ing  concentrations  of  cadmium  alone  and 
in  combination  with  other  heavy  metals 
without  exhibiting  crosstalk,  but  it  also 
shows  sensitivity  to  low  and  high  con¬ 
centrations  of  zinc.  However,  since  the 
zraP  promoter  is  specific  for  zinc,  the 
boolean  logic  expression  (zntA)A(-izraP) 
represents  a  unique  combined  promoter  re¬ 
sponse  that  is  specific  for  cadmium.  The 
recN  and  sulA  promoters  in  Figure  15c- 
d  respond  specifically  to  high  concentra¬ 
tions  of  chromium(VI)  alone.  These  pro¬ 
moters  are  known  to  activate  during  multi¬ 
ple  double-strand  DNA  breaks  and  the  cell 
SOS  response,  respectively.  Their  differ¬ 
ential  expression  here  corroborates  recent 
reports  of  chromium(VI)  acting  as  a  car¬ 
cinogen  in  water  supplies19, 20,21  and  may 


provide  information  on  its  mode  of  action. 
The  ygbA  promoter  in  Figure  15e  responds 
specifically  to  cobalt  alone.  The  cusR  pro¬ 
moter  in  Figure  15f  responds  specifically  to 
high  concentrations  of  copper  alone.  This 
gene  serves  as  another  excellent  positive 
control,  as  our  literature  search  showed  it 
to  be  among  the  most  widely  studied  genes 
for  sensing  copper  in  E.  coli.  The  ybil 
promoter  in  Figure  15g  responds  specifi¬ 
cally  and  monotonically  to  increasing  con¬ 
centrations  of  lead  alone  and  shows  higher 
specificity  than  the  pbrR  promoter  identi¬ 
fied  from  the  literature.  The  nemR  pro¬ 
moter  in  Figure  15h  responds  specifically 
to  high  concentrations  of  malathion  alone. 
The  identification  of  a  sensitive  and  spe¬ 
cific  gene  for  detecting  malathion  in  E.  coli 
is  entirely  unexpected,  as  the  mechanism 
of  toxicity  for  organophosphate  pesticides 
is  known  to  only  operate  in  eukaryotes,  as 
organophosphate  pesticides  are  neurotox¬ 
ins.  Lastly,  the  zraP  promoter  in  Figure  15i 
responds  specifically  and  monotonically  to 
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Ammonium  sensing  gene  nasB  (B.  subtilis  168) 
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Figure  16:  The  nasB  promoter  in  B.  subtilis  168 
was  found  to  be  sensitive  and  specifically  down- 
regulated  in  response  to  ammonium,  given  a  mean 
fold  change  threshold  of  0.5. 


increasing  concentrations  of  zinc  alone.  The 
sensitivity  of  zraP  to  zinc  is  extremely  high, 
making  this  promoter  an  excellent  candi¬ 
date  for  logical  combination  with  promot¬ 
ers  exhibiting  zinc  interference  to  generate 
toxin-specific  multi-promoter  responses. 

We  used  RNA-Seq  to  verify  the  nasB 
promoter  in  B.  subtilis  168  as  a  sensitive 
and  specific  promoter  for  the  last  remain¬ 
ing  toxin,  ammonium,  in  early  Q3  (see 
Fig.  16).  As  opposed  to  other  response  cir¬ 
cuitry,  nasB  is  down-regulated  in  response 
to  the  toxin,  allowing  the  construction  of 
a  "lights-off"  sensor.  This  is  desirable  be¬ 
cause,  unlike  the  other  water  toxins  of  inter¬ 
est,  ammonium  is  expected  to  increase  the 
growth  rate  of  biosensor  cells.  In  this  situa¬ 
tion,  a  "lights-on"  sensor  could  be  triggered 
by  any  substance  that  increases  the  growth 
rate  and  resulting  GFP  production  ability  of 
cells,  whereas  a  "lights-off"  sensor  remains 
immune  to  this  effect. 

In  summary,  through  constructing 
strains  based  on  literature  searches  and 
performing  RNA-Seq  analysis,  we  discov¬ 
ered  sensitive  promoters  for  all  toxins  of 
interest.  See  Appendix  A  for  a  full  list  of 
candidate  toxin-responsive  promoters. 


Task  B 

Task  Objective:  Develop  a  preliminary  mi¬ 
crofluidic  device  that  can  culture  many  in¬ 
dependent  E.  coli  sensor  strains  simultane¬ 
ously 

Milestone:  12  months 

Metrics/Completion  Criteria: 

Milestone  3:  Library  expansion  of  tran¬ 
scription  based  sensors 

Milestone  4:  Microfluidic  device  develop¬ 
ment  to  support  environmental  sensing 

Milestone  5:  Low  cost  optical  methods  de¬ 
velopment 

Deliverable:  Library  of  E.  coli  clones  with 
target  promoters  producing  GFP;  prelimi¬ 
nary  microfluidic  device  capable  of  cultur¬ 
ing  many  different  clonal  populations  in  a 
defined  array 

Reporting  Updates 

Milestone  3:  Library  expansion  of  tran¬ 
scription  based  sensors 

Summary:  We  have  completed  this  mile¬ 
stone.  The  promoter  regions  for  the  differ¬ 
entially  expressed  genes  identified  in  Mile¬ 
stone  2  were  located  using  computational 
tools  (Milestone  3.1).  We  have  cloned  these 
promoters  into  the  Milestone  1  expression 
systems  (Milestone  3.2).  We  validated  the 
functionality  of  these  new  promoter  con¬ 
structs  using  microfluidics  (Milestone  3.3). 
This  validation  of  our  RNA-Seq  identified 
promoters  provides  strong  evidence  of  the 
power  and  utility  of  our  approach  to  de¬ 
veloping  novel  transcription  based  biosen¬ 
sors. 

Milestone  3.1:  Perform  sequence  analysis  to 
determine  the  promoters  of  candidate  genes 
identified  in  Milestone  2:  Completed  in  Q2. 
We  have  identified  the  promoter  regions  for 
all  candidate  genes.  The  promoters  of  in- 
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Figure  17:  Validation  of  five  cobalt-sensing  promot¬ 
ers  (Co3,  Cod,  Co6,  Co7,  Co8)  identified  via  RNA- 
Seq  analysis  with  E.  coli  and  one  (Co2)  identified 
from  literature. 

terest  are  located  on  the  genome  directly 
upstream  of  the  genes  identified  by  RNA- 
Seq.  For  uncharacterized  promoters,  we 
used  200bp  upstream  of  the  gene's  tran¬ 
scription  start  site  to  construct  the  sensor 
plasmid.  Including  this  entire  region  en¬ 
sures  that  even  cryptic  regulatory  sites  act¬ 
ing  on  the  promoter  are  included  in  the  sen¬ 
sor  construct. 

Milestone  3.2:  Clone  the  promoters  into  the  ex¬ 
pression  system  validated  in  Milestone  1:  Com¬ 
pleted  in  Q2.  We  have  cloned  all  identi¬ 
fied  promoters  into  the  standardized  plas¬ 
mid  expression  system  validated  in  Mile¬ 
stone  1. 

Milestone  3.3:  Validate  the  response  of  the  pro¬ 
moters  using  the  methods  developed  in  Mile¬ 
stone  1:  Completed  in  Q2,  with  updated  re¬ 
sults  through  Q4. 

In  Q2,  we  validated  our  RNA-Seq 
approach  to  identifying  toxin-responsive 
promoters  using  plate  reader  data  from 
five  cobalt-sensing  promoters  identified  via 
RNA-Seq  analysis  (see  Fig.  17).  We  grew 
strains  in  microplate  wells  in  the  pres¬ 
ence  of  various  concentrations  of  cobalt 
and  measured  GFP  fluorescence.  E.  coli 
MG1655  strains  Col  and  Co2  incorporate 
the  nmtR  promoter  from  the  M.  tuber¬ 
culosis  genome  as  the  cobalt-sensing  ele¬ 


ment,  whereas  E.  coli  MG1655  strains  Co3- 
Co8  incorporate  cobalt-responsive  promot¬ 
ers  identified  through  RNA-Seq.  Odd- 
numbered  strains  express  GFP  using  the  na¬ 
tive  RBS,  whereas  even-numbered  strains 
use  the  optimized  Lutz  RBS.  We  concluded 
that  strains  Co2-Co4  and  C06-C08  signifi¬ 
cantly  respond  to  the  presence  of  cobalt  in 
the  growth  medium.  Therefore,  we  have 
demonstrated  that  our  exhaustive  method¬ 
ology  for  RNA-Seq  analysis  supported 
by  multiple  techniques  and  algorithms 
works. 

The  set  of  validated  constructs  shown 
in  Figure  18  contains  one  construct  that 
has  been  demonstrated  sensitive  to  each 
toxin  of  interest  within  our  sensor  proto¬ 
type,  with  the  exception  of  malathion  for 
which  the  positive  results  that  we  observed 
in  our  original  microscopy  runs  (shown  in 
this  plot)  have  not  yet  translated  to  our  sen¬ 
sor  boxes.  During  investigation,  we  dis¬ 
covered  a  more  robust  malathion  detection 
scheme  that  leverages  its  esterase  inhibition 
effect  in  yeast  (see  Fig.  6).  Our  successful 
testing  of  this  method  on  our  microscope 
illustrates  the  power  of  our  forthcoming 
whole-genome  detection  scheme,  whereby 
we  will  continually  measure  the  responses 
of  2,000  E.  coli  and  S.  cerevisiae  promoters 
arrayed  in  a  microfluidic  chip.  We  expect 
the  observed  esterase  inhibition  to  be  sig¬ 
nificantly  represented  in  the  genomic  re¬ 
sponse  of  this  multi-strain  library  during 
malathion  exposure. 

Milestone  4:  Microfluidic  device  develop¬ 
ment  to  support  environmental  sensing 

Summary:  We  have  successfully  com¬ 
pleted  this  milestone.  We  developed  a 
concentrated  media  additive  to  enable  ro¬ 
bust  growth  of  E.  coli  in  natural  water 
sources  (Milestone  4.1).  We  cultured  E.  coli 
in  our  microfluidic  devices  for  up  to  50 
days,  demonstrating  long-term  reliability 
(Milestone  4.2).  We  developed  and  op- 
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Figure  18:  On-chip  time-lapse  induction  responses  of  promoter  constructs  on  our  research  grade  micro¬ 
scope.  All  sensor  constructs  are  on  plasmids  transformed  into  E.  coli  MG1655,  except  for  the  ammonium 
construct  which  is  integrated  into  the  genome  of  B.  subtilis  NCIB  3610.  a)  pRS18  and  pZA47a  show  induc¬ 
tion  by  arsenic,  b)  Cdl  shows  induction  by  cadmium,  c)  Crll  shows  induction  by  chromium(VI).  d)  Cul 
shows  induction  by  copper,  e)  Pb7  shows  induction  by  lead,  f)  Flg3  shows  induction  by  mercury,  g)  Mall 
initially  showed  induction  by  malathion;  however,  we  have  not  been  able  to  acquire  similar  data  in  our 
sensor  boxes.  We  have  recently  identified  a  more  robust  sensing  mechanism  using  the  esterase  inhibition 
effect  of  malathion  in  yeast,  h)  A  mm3  shows  induction  by  ammonium. 


timized  a  microfluidics-capable  peristaltic 
pump  for  extracting  natural  water  (Mile¬ 
stone  4.3).  We  greatly  expanded  our  mi¬ 
crofluidic  device,  allowing  the  culture  of 
multiple  strains  (Milestone  4.4).  We  sig¬ 
nificantly  increased  the  fluorescence  signal 
from  our  microfluidically  cultured  cells  by 
optimizing  the  trap  geometry  (Milestone 
4.5).  We  developed  techniques  to  prevent 
the  release  of  our  genetically  modified  or¬ 
ganisms  (Milestone  4.6).  We  developed  a 
reliable  method  to  freeze-dry  and  store  cells 
at  room  temperature  (Milestone  4.7).  We 
modified  our  "gill"  microfluidic  device  to 
allow  freeze-drying  of  cells  on-chip  (Mile¬ 
stone  4.8).  We  successfully  tested  our  mi¬ 
crofluidic  devices  using  our  low-cost  opti¬ 
cal  system  (Milestone  4.9). 

Milestone  4.1:  Develop  a  concentrated  media 
additive  that  can  be  mixed  with  natural  wa¬ 
ters  to  support  E.  coli  growth:  Completed  in 
Q2,  with  updated  results  through  Q3.  We 
have  developed  and  tested  medium  formu¬ 
lations  that  successfully  culture  E.  coli  and 
B.  subtilis  in  batch  and  on-chip. 


For  all  batch  culture  experiments,  M9 
minimal  medium  supplemented  with  0.4% 
w/v  glucose,  0.1  mM  CaCl2,  and  2  mM 
MgSC>4  was  used.  For  B.  subtilis,  the 
medium  was  additionally  supplemented 
with  0.075%  v/v  TWEEN  20,  50  pM  FeCl3, 
50  pM  MnCl2,  and  1  pM  ZnCl2.  For  B.  sub¬ 
tilis  ammonium  exposure  experiments,  the 
NFI4CI  in  M9  minimal  medium  was  re¬ 
placed  with  NaNC>3,  keeping  the  concentra¬ 
tion  of  nitrogen  constant. 

For  microfluidic  experiments,  we  de¬ 
veloped  a  minimal  medium  optimal  for 
growth  of  bacteria  and  heavy  metal  sens¬ 
ing,  adapted  from  FIMM.  This  medium  re¬ 
places  the  inorganic  phosphate  in  M9  min¬ 
imal  medium  with  glycerol-2-phosphate, 
MOPS  (pH  =  7.2),  and  KC1.  Inorganic  phos¬ 
phate  is  undesirable  because  of  its  metal 
chelation  properties  and  its  propensity  to 
form  calcium  phosphate  deposits  within 
microfluidic  channels.  To  minimize  con¬ 
taminating  metals,  all  microfluidic  experi¬ 
ments  were  carried  out  with  media  made 
with  extra  high  purity  salts  where  avail¬ 
able.  We  found  that  when  using  these  pure 
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salts  robust  E.  coli  growth  required  supple¬ 
menting  the  medium  with  iron,  and  robust 
B.  subtilis  growth  required  iron,  zinc,  and 
manganese. 

The  final  composition  of  our  E.  coli 
medium,  following  on-chip  mixing  with 
source  water,  was: 

1.  40  mM  PharmaGrade  MOPS  [Sigma 
#PHG0007-1KG]  (from  1  M  stock  at  pH  7.2) 

2.  4  mM  glycerol-2-phosphate  [Sigma  #G6501- 
25G] 

3.  0.4%  w/v  dextrose  (glucose)  [Sigma  #D9434- 
1KG] 

4.  1  g/1  (19  mM  or  262  ppm  NH4-N)  TraceSelect 
NH4CI  [Sigma  #09725-100G] 

5.  3.7  g/1  (50  mM)  TraceSelect  KC1  [Sigma 
#05257-100G] 

6.  0.075%  v/v  Tween  20  [Acros  Organics 
#23336-0010] 

7.  50  /(g/mL  spectinomycin  (from  spectino- 
mycin  dihydrochloride  pentahydrate)  [Sigma 
#S4014-5G] 

8.  1  }iM  FeCl3  [Alfa  Aesar  #A16231-500G] 

9.  0.01  mM  CaCh  [Macron  Fine  Chemicals 
#4160-12] 

10.  0.2  mM  MgS04  [Macron  Fine  Chemicals 
#6066-04] 

The  final  composition  of  our  B.  subtilis 
medium,  following  on-chip  mixing  with 
source  water,  was  the  same  as  for  E.  coli 
with  the  following  modifications: 

1.  Replace  NH4C1  with  1.6  g/1  NaN03 

2.  Use  50  /(M  FeCl3  instead  of  1  }iM 

3.  Add  50  }iM  MnCl2  [Baker  #2540-01] 

4.  Add  1  }iM  ZnCl2  [Macron  #8780-04] 

Note  that  TraceSelect  formulations  of 
reagents  were  used  when  available  to  mini¬ 
mize  the  potential  for  heavy  metal  contam¬ 
ination  of  the  media. 

Milestone  4.2:  Perform  testing  to  demonstrate 
the  long  term  culture  stability  of  such  cells. 
Quantify  growth  rate  of  cells  and  lifespan  of 
cultures:  Completed  in  Q2,  with  updated 
results  through  Q4.  We  have  successfully 
grown  cells  in  microfluidic  devices  with  a 
stable  growth  rate  comparable  to  batch  cul¬ 
ture. 


We  have  continuously  grown  E.  coli 
MG1655  in  the  gill  chip  for  23  days  using 
growth  medium  concentrate  mixed  with 
natural  water  from  Lake  Miramar.  In  the 
most  recent  experiment,  10  x  M9  medium 
concentrate  was  mixed  with  Milli-Q  water 
in  a  1:9  ratio  at  a  total  flow  rate  of  1  ml/h  us¬ 
ing  a  dual-channel  Instech  P625  peristaltic 
pump.  Cells  were  observed  to  grow  and  ex¬ 
press  GFP  in  the  traps  after  50  days  in  the 
device,  proving  that  the  sensor  strain  is  sta¬ 
ble  over  this  time  period. 

Milestone  4.3:  Develop  a  metering  and  mix¬ 
ing  system  to  combine  the  concentrated  me¬ 
dia  stock  with  natural  water  at  a  fixed  ratio 
and  mix  it  well  before  cell  exposure:  Com¬ 
pleted  in  Q2,  with  updated  results  through 
Q4.  We  used  an  Instech  P625  peristaltic 
pump  to  mix  concentrated  media  with  nat¬ 
ural  water  at  a  fixed  ratio  and  calibrated 
the  pump  to  achieve  the  desired  flow  rate. 
Since  our  media  additive  stocks  are  concen¬ 
trated  10-20  x,  we  dilute  them  with  natural 
water  before  delivery  to  the  cells  by  using 
a  peristaltic  pump  to  drive  each  liquid  at 
a  defined  volumetric  flow  rate  through  sil¬ 
icone  tubing  into  the  chip.  Our  pumping 
scheme  uses  a  custom-made  dual-channel 
tubing  set  with  different  tubing  inner  diam¬ 
eters,  which  results  in  a  constant  flow  ra¬ 
tio.  In  laboratory  tests,  we  have  used  tub¬ 
ing  sets  with  1:20  and  1:9  flow  ratios  to  suc¬ 
cessfully  combine  and  completely  mix  the 
two  liquid  streams  on-chip  using  staggered 
herringbone  mixers  to  support  healthy  cell 
growth. 

Milestone  4.4:  Expand  the  current  E.  coli 
large  scale  culture  device  (biopixel  device) 
to  have  500  individually  addressable  cham¬ 
bers:  We  have  designed  a  large-scale  gill 
chip  using  nested  symmetrically-split  par¬ 
allel  channels  to  produce  balanced  flow  (see 
Fig.  19).  However,  our  analysis  of  RNA- 
Seq  data  determined  that  the  incidence  of 
specific  toxin-sensing  promoters  in  E.  coli  is 
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Figure  19:  CAD  design  for  a  highly  parallel  mi¬ 
crofluidic  device  capable  of  culturing  512  unique 
toxin-sensing  promoter  strains  in  individually- 
addressable  "gill"  cell-trapping  regions.  Two  in¬ 
let  ports  at  the  top  are  combined  and  mixed  during 
passage  through  staggered  herringbone  mixers  (yel¬ 
low)  before  branching  into  512  "gill"  trapping  re¬ 
gions  (red)  and  recombining  at  the  outlet  port.  Tire 
footprint  of  the  entire  device  is  small  enough  to  fit 
on  a  l"x3"  glass  slide. 

greater  than  expected.  Therefore,  we  do  not 
require  hundreds  of  cell-trapping  regions  in 
the  microfluidic  device,  and  we  decided  to 
pursue  a  cell  patterning  strategy  whereby 
cells  are  micropipetted  into  shallow  but  rel¬ 
atively  large  reservoirs  upstream  of  the  cell 
trapping  regions. 

Milestone  4.5:  Increase  the  height  of  the 
Biopixel  device's  chambers  to  increase  the  out¬ 
put  optical  signal.  Ensure  the  cell  growth  dy¬ 
namics  are  unchanged:  Completed  in  Ql. 
We  constructed  and  tested  an  optimized  gill 
chip  design  that  raises  the  height  of  the  cell 
trapping  regions  to  50  pm,  thereby  increas¬ 
ing  the  fluorescent  signal. 

Notably,  we  have  observed  synthetic 
strains  of  E.  coli  MG1655  produce  levels  of 
GFP  within  the  device  that  are  visible  to 
the  naked  eye  using  the  appropriate  filters. 
Over  the  course  of  dozens  of  experiments, 
we  have  verified  that  cell  growth  rate  is  un¬ 
affected  by  these  minor  modifications  to  the 
"biopixel"  cell  trap  design. 

Milestone  4.6:  Develop  a  UV  EED  system  that 
kills  the  cells  as  they  emerge  from  the  trapping 
region  and  enter  waste  collection.  Perform  via¬ 
bility  test  on  media  exiting  the  waste  trap:  We 
concluded  that  UV  LEDs  are  an  inefficient 
method  for  sanitizing  the  chip  effluent. 


Short-wave  UV  LEDs  consume  power,  emit 
heat,  and  have  short  lifetimes,  which  un¬ 
necessarily  reduces  the  operational  lifetime 
of  our  biosensor  device.  Instead,  we  de¬ 
cided  to  route  cell  waste  to  a  reservoir  con¬ 
taining  bleach,  which  has  been  used  as  an 
effective  sanitizer  for  hundreds  of  years. 

In  Q2,  we  tested  this  strategy  by  deposit¬ 
ing  effluent  from  the  pump  mixing  experi¬ 
ments  into  a  1-1  glass  bottle  pre-filled  with 
100  ml  of  bleach.  The  waste  bottle  was 
used  to  collect  cell  waste  throughout  four 
experiments  over  a  period  of  two  months 
and  filled  to  around  500  ml.  To  test  for 
cell  viability,  the  waste  bottle  was  stirred, 
and  20  p\  of  the  contents  was  plated  on  LB 
agar.  As  a  positive  control,  45  pi  of  waste 
was  removed,  bleached  again,  and  plated 
using  the  same  method.  Neither  the  pre¬ 
bleached  waste  nor  the  post-bleached  waste 
showed  any  bacterial  growth  after  1  day  on 
LB  agar  without  antibiotics,  suggesting  the 
pre-bleaching  method  is  sufficient  to  elimi¬ 
nate  viable  cells  in  the  waste  bottle. 

Milestone  4.7:  Develop  a  method  to  freeze-dry 
cells  alloiving  them  to  be  rehydrated  with  little 
loss  of  viability:  Completed  in  Q3.  We  have 
successfully  developed  a  method  for  in¬ 
chip  lyophilization  and  revival  after  long¬ 
term  storage. 

Several  cryoprotectants  suitable  for  en¬ 
gineered  biosensor  strains  were  formulated 
from  a  combination  of  literature-based  pro¬ 
tocols,  current  industrial  practices,  and  ex¬ 
perimentation.  The  investigated  cryopro¬ 
tectants  include: 

1.  2.5%  Luria-Bertrani  Broth  (LB)  (w/v)  +  specti- 
nomycin 

2.  2.5%  LB  +  0.4%  glucose  (w/v)  +  spectino- 
mycin 

3.  2.5%  LB  +  0.4%  sucrose  (w/v)  +  spectino- 
mycin 

4.  2.5%  LB  +  0.4%  trehalose  (w/v)  +  spectino- 
mycin 

5.  M9  +  0.4%  glucose  +  spectinomycin 

Relative  cryoprotectant  efficacy  was  de- 
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Figure  20:  Cryoprotectant  revival  rates  for  arsenic¬ 
detecting  E.  coli  plasmid/ strain  pLB-As3IMG1655. 
The  cryoprotectant  is  listed  over  each  plot,  with  the 
overnight  and  revival  medium  listed  in  that  order  in 
parentheses  (i.e.  a  strain  cryoprotected  in  LB  after 
being  grown  overnight  in  M9  and  revived  in  Trace 
Select  M9  medium  is  denoted  by  "LB  (M9/Trace)"). 

termined  via  plate  reader  revival  experi¬ 
ments  performed  24  h,  1  wk,  2  wk,  4  wk, 
and  8  wk  after  lyophilization.  Cells  were  re¬ 
vived  via  rehydration  and  resuspension  in 
200  }i\  of  revival  medium  within  microplate 
wells.  The  plates  were  then  immediately 
placed  into  a  Tecan  Infinite  M200  Pro  plate 
reader,  where  growth  rates  were  monitored 
over  the  next  48  h. 

Revival  media  included: 

1.  M9  +  0.4%  glucose  +  spectinomycin 

2.  Trace  Select  M9  +  0.4%  glucose  +  spectino¬ 
mycin 

3.  Trace  Select  M9  +  0.4%  glucose 

4.  HM9  (nitrate)  +  0.4%  glucose 

and  were  selected  to  be  representative  of 
the  growth  medium  used  in  the  final  de¬ 
vice. 

Strains  protected  with  optimal  cryopro- 
tectants  showed  little  difference  in  viability 
between  cryoprotectants  after  two  months 
of  preservation.  Both  E.  coli  and  B.  subtilis 
strains  responded  similarly  to  lyophiliza¬ 
tion  in  the  cryoprotectants  listed  above  (see 
Fig.  20). 

The  best  cryoprotectant  was  found  to 


Figure  21:  Images  showing  the  on-chip  revival  of 
lyophilized  sensor  strain  pLB-As7  (MG1655).  De¬ 
vice  image  a)  0.5  h  after  chip  wetting  and  b)  after 
11  hours  of  robust  post-revival  growth.  The  strain 
was  loaded,  grown  to  confluence,  lyophilized  in  LB 
+  0.4%  sucrose  +  spectinomycin  for  17  h  (9  h  freeze, 
8  h  dry),  stored  for  24  h,  and  revived  in  LB  +  specti¬ 
nomycin. 

be  LB  +  0.4%  sucrose  and  was  used  to 
test  batch  and  on-chip  lyophilization  and 
revival  in  biosensor  host  strains  E.  coli 
MG1655  and  LABEC31  and  B.  subtilis  NCIB 
3610.  In  batch,  revival  efficacy  exper¬ 
iments  were  performed  at  24  h,  1  wk, 
2  wk,  4  wk,  and  8  wk  after  lyophiliza¬ 
tion,  with  no  observed  reduction  in  viabil¬ 
ity.  For  on-chip  testing,  strains  were  cul¬ 
tured  overnight  to  stationary  and  sporula- 
tion  phases,  respectively.  Strains  were  then 
double-washed  in  cryoprotectant  and  con¬ 
centrated  to  50  x  their  batch  culture  con¬ 
centration  before  being  injected  into  an  18- 
strain  chip  through  independent  loading 
ports.  Loading  ports  were  sealed  with 
a  fast-curing  silicone  elastomer  (Sylgard 
170,  Dow  Corning),  then  the  device  was 
lyophilized  in  a  commercial  freeze-dryer 
for  17  h  before  being  nitrogen-flushed,  des¬ 
iccated,  and  sealed  within  opaque  packag¬ 
ing. 

Following  room-temperature  storage  for 
up  to  two  months,  strains  were  revived 
via  de-gas  driven  chip  wetting  and  pump- 
driven  flow.  Upon  introducing  fresh 
medium,  strains  revived  on  time  scales 
equivalent  to  those  in  batch  (see  Fig.  21  and 
movie  here).  In  addition,  the  ammonium¬ 
sensing  B.  subtilis  NCIB  3610  strain  was  suc¬ 
cessfully  sporulated  using  standard  sporu- 
lation  medium,  which  offers  an  additional 
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preservation  method  with  extreme  shelf- 
life. 

Milestone  4.8:  Develop  a  deposition  tech¬ 
nique  to  place  cells  into  a  region  of  a  microflu¬ 
idic  device  that  is  then  bonded  to  a  glass  cov- 
erslip:  Completed  in  Q3.  We  have  suc¬ 
cessfully  developed  a  deposition  strategy 
whereby  individual  biosensor  strains  are 
injected  into  on-chip  reservoirs,  where  they 
are  lyophilized,  stored,  and  revived. 

To  independently  culture  multiple 
biosensor  strains,  an  18-strain  chip  was 
designed,  built,  and  successfully  tested 
for  multi-strain  loading  and  freeze-drying. 
The  chip  dimensions  conform  to  the  re¬ 
quirements  of  both  the  strains  and  the 
biosensor's  optical  detection  system. 

Milestone  4.9:  Develop  and  test  the  microflu¬ 
idic  device  in  a  laboratory  environment:  Com¬ 
pleted  in  Q2.  We  have  conducted  extensive 
testing  of  our  microfluidic  devices  in  a  lab¬ 
oratory  environment,  using  both  research 
grade  and  low  cost  optical  systems. 

In  Q2,  the  microfluidic  “gill"  chip  de¬ 
scribed  in  Milestone  1.4  was  used  to  test 
the  optical  system  developed  by  the  Ziva 
Corporation.  We  compared  the  optical  sys¬ 
tem  to  that  of  our  research  grade  micro¬ 
scope,  an  Olympus  1X81.  The  Ziva  opti¬ 
cal  system  was  designed  with  lower  reso¬ 
lution  optics  compared  to  those  of  the  4x 
objective  on  the  1X81  in  order  to  lower  cost 
while  increasing  the  imaged  field  of  view. 
Although  the  Ziva  optics  are  lower  resolu¬ 
tion,  the  produced  images  compare  favor¬ 
ably  with  the  1X81  as  shown  in  Figure  22. 

Milestone  5:  Low  cost  optical  methods  de¬ 
velopment 

Summary:  We  have  successfully  com¬ 
pleted  this  milestone.  We  have  developed 
a  low  cost  optical  system  by  partnering 
with  the  Ziva  Corporation,  thereby  com¬ 
pleting  sub-milestones  5.1,  5.2,  and  5.3.  We 
have  also  determined  that  bioluminescent 
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Figure  22:  A  comparison  of  optical  systems.  The 
same  microfluidic  device  was  imaged  with  a)  low 
cost  biosensor  optics  developed  by  the  Ziva  Corpo¬ 
ration  and  b)  a  research  grade  Olympus  1X81  mi¬ 
croscope.  While  the  Ziva  image  may  appear  infe¬ 
rior,  this  is  an  artifact  due  to  zooming  of  the  image 
for  comparison  (the  Ziva  system  is  designed  with  a 
wide  field  of  view  compared  to  the  Olympus).  We 
have  verified  that  the  Ziva  system  yields  images 
that  equal  the  quality  of  the  Olympus  system,  which 
is  remarkable  given  the  difference  in  cost  ($2K  for 
Ziva  versus  $125Kfor  Olympus). 

systems  can  be  significantly  more  sensitive 
than  fluorescence  based  systems,  thus  com¬ 
pleting  Milestone  5.4. 

Milestone  5.1:  Design  and  construct  an  LED 
array  for  the  GFP  excitation  of  the  cells  in  the 
microfluidic  device:  Completed  in  Q2.  The 
excitation  LED  system  has  been  designed 
and  delivered. 

In  Q2,  the  optical  setup  was  designed 
in  partnership  with  the  Ziva  Corporation. 
We  chose  a  CREE  XTE  Indus  Star  1  Royal 
Blue  Eligh  Power  LED  (manufacturer  part 
#CREEXTE-ROY-l)  with  an  emission  wave¬ 
length  maximum  of  465  nm,  which  is  well- 
suited  for  GFP  excitation.  The  LED  is  paired 
with  a  1400  mA  BuckBlock  constant  current 
LED  driver  (manufacturer  part  #0A009-D- 
V-1400)  and  a  LED  light  housing  with  a  15 
W  heat  sink  (manufacturer  part  #ALK-LH- 
15W). 

Milestone  5.2:  Design  and  construct  optical 
filters  for  GFP  excitation  and  emission  which 
cover  the  entire  area  of  the  microfluidic  de¬ 
vice:  Completed  in  Q2.  The  optical  filters 
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for  GFP  excitation  and  emission  have  been 
designed  and  delivered. 

The  optical  setup  was  designed  in  part¬ 
nership  with  the  Ziva  Corporation.  The 
necessary  filters  were  purchased  from  Sem- 
rock,  Inc.  The  part  numbers  are  FF495-Di03- 
25x36,  FF01-520/ 35-23.3,  and  FF01-457/50- 
25  for  the  dichroic,  emission,  and  excitation 
filters,  respectively. 

Milestone  5.3:  Obtain  and  characterize  the 
performance  of  a  low  cost  camera  system  to  im¬ 
age  the  fluorescence  signal  of  the  device:  Com¬ 
pleted  in  Q3.  We  have  built  an  imag¬ 
ing  system  using  the  "Chameleon"  camera 
(part  #CMLN-12S2M-CS)  from  Point  Grey 
Research,  Inc.  with  the  design  assistance 
of  the  Ziva  Corporation.  This  is  a  1.3 
megapixel  monochrome  camera  featuring  a 
Sony  ICX445  CCD  imager.  It  contains  a  12- 
bit  analog-to-digital  converter  with  a  max¬ 
imum  gain  of  24  dB.  The  camera  package 
includes  a  software  development  kit  (SDK), 
known  as  FlyCapture,  which  is  compati¬ 
ble  with  the  PandaBoard  single  board  com¬ 
puter  system  that  we  have  chosen  for  our 
electronics  platform. 

The  microfluidic  "gill"  chip  developed 
in  Milestone  1.4  was  used  to  compare  the 
optical  system  developed  by  the  Ziva  Cor¬ 
poration  with  our  research  grade  micro¬ 
scope,  an  Olympus  1X81.  The  Ziva  optical 
system  was  designed  with  lower  resolution 
optics  compared  to  those  of  the  4x  objec¬ 
tive  on  the  Olympus  in  order  to  lower  cost 
while  increasing  the  image  field  of  view 
by  20  x  for  imaging  multiple  "gill"  trap¬ 
ping  regions.  Images  acquired  with  the 
Ziva  optics  compare  favorably  with  those 
acquired  with  the  Olympus,  as  shown  in 
Figure  22.  Notably,  the  Ziva  optics  are  x 
more  sensitive  at  detecting  GFP  than  the 
Olympus  optics  (SNRs  are  23.8  and  11.6,  re¬ 
spectively).  Because  our  primary  objective 
is  to  detect  weak  signals,  the  Ziva  system 
outperforms  the  Olympus  system  at  ^50  x 
lower  cost. 
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Figure  23:  A  side-by-side  comparison  of  reporter 
systems  on  otherwise  identical  plasmid  backbones 
demonstrates  the  superior  detection  limits  achieved 
by  using  luminescence. 


Milestone  5.4:  Replace  the  GFP  fluorescence 
system  with  a  luminescent  system  based  on 
the  lux  operon  of  A.  fischeri:  Completed 
in  Q2.  We  have  replaced  the  GFP  fluores¬ 
cence  system  with  a  Lux  system  for  our 
best-performing  arsenic  sensor  plasmid. 

The  arsenic  sensor  construct  As7  was 
modified  to  replace  the  gfp  gene  with  the 
luxCDABE  operon,  and  relative  induction 
(fluorescence  or  luminescence)  of  the  two 
constructs  was  tested  side-by-side  using  a 
Tecan  Infinite  M200  Pro  plate  reader  (see 
Fig.  23).  The  background  noise  was  signif¬ 
icantly  lower  for  the  Lux  construct,  allow¬ 
ing  the  detection  of  arsenic  at  a  concentra¬ 
tion  of  0.2  nM.  In  contrast,  the  GFP  con¬ 
struct  was  sensitive  to  arsenic  only  above  a 
concentration  of  26  nM.  For  comparison,  we 
were  able  to  detect  an  arsenic  concentration 
of  130  nM  using  the  GFP  construct  in  a  mi¬ 
crofluidic  device.  Thus,  replacing  GFP  with 
Lux  promises  to  greatly  increase  the  sensi¬ 
tivity  of  the  biosensor. 


Task  C 

Task  Objective:  Develop  a  prototype  of  a 
deployable  device  for  cheap  and  continu¬ 
ous  monitoring  of  water  contamination  by 
specific  target  compounds 


30 


Milestone:  12  months 

Metrics/Completion  Criteria: 

Milestone  6:  Device  prototype  develop¬ 
ment 

Milestone  7:  Build  computational  models 
to  determine  the  threshold  of  detection  for 
specific  sensors  based  on  models  of  experi¬ 
mental  GFP  responses 

Milestone  8:  Use  the  models  obtained  in 
Milestone  7  to  investigate  whether  the  com¬ 
bination  of  nonspecific  sensor  responses  to 
some  toxins  can  be  utilized  to  improve 
specificity 

Milestone  9:  Develop  the  controller  board 
to  carry  out  sophisticated  analysis  of  com¬ 
plex  data 

Deliverable:  Finalized  device  prototype 
and  computational  algorithm  for  continu¬ 
ous  water  quality  monitoring 

Reporting  Updates 

Milestone  6:  Device  prototype  develop¬ 
ment 

Summary:  We  have  successfully  com¬ 
pleted  this  milestone.  We  obtained  and 
characterized  a  low  power  peristaltic  pump 
(Milestone  6.1).  We  have  obtained  a  fil¬ 
ter  to  prevent  device  clogging  (Milestone 

6.2) .  We  have  demonstrated  that  flow  re¬ 
versal  to  prevent  clogging  is  not  necessary 
based  on  our  filtration  design  (Milestone 

6.3) .  We  have  obtained  an  environmental 
enclosure  and  heater  (Milestones  6.4  and 

6.5) .  We  have  obtained  and  developed  the 
software  components  necessary  for  device 
control  and  data  transmission  (Milestone 

6.6) .  We  have  developed  a  positive  control 
stock  for  sensor  testing  purposes  (Milestone 

6.7) .  We  have  developed  a  solar  cell  based 
charging  system  (Milestone  6.8).  We  have 
assembled  a  fully  functional  prototype  and 
have  tested  it  in  an  outdoor  environment 
(Milestone  6.9). 


Milestone  6.1:  Find  and  characterize  a  low 
power  pumping  system  that  can  pump  natural 
water  through  the  microfluidic  device  at  a  rate 
of  1  ml/h:  Completed  in  Ql.  We  have  pur¬ 
chased  and  tested  a  low  power  peristaltic 
pump  compatible  with  microfluidics. 

Milestone  6.2:  Develop  a  filter  system  to  pre¬ 
vent  clogging  of  the  microfluidic  device:  Com¬ 
pleted  in  Ql.  We  have  successfully  imple¬ 
mented  filtering  to  prevent  clogging  when 
pumping  natural  water  sources. 

Milestone  6.3:  Develop  a  flow  reversal  pump¬ 
ing  regime  to  help  clear  the  filter  of  contami¬ 
nants.  Test  and  revise  the  de-clogging  method  to 
ensure  a  runtime  of  at  least  one  month  per  sen¬ 
sor:  Completed  in  Ql.  Due  to  the  success 
of  our  filter  system  developed  in  Milestone 
6.2,  we  determined  that  flow  reversal  was 
not  necessary. 

Milestone  6.4:  Develop  a  waterproof  enclosure 
that  can  he  used  to  house  the  device  and  the  elec¬ 
tronics:  Completed  in  Ql.  We  have  pur¬ 
chased  and  tested  an  all-weather  enclosure 
suitable  for  outdoor  use. 

Milestone  6.5:  Develop  a  heating  system  to  en¬ 
sure  the  microfluidic  device  maintains  the  ap¬ 
propriate  growth  temperature:  Completed  in 
Ql.  We  have  tested  a  heating  system  for  our 
device. 

Milestone  6.6:  Develop  the  electron¬ 

ics/software  to  coordinate  the  pumping  regimes, 
image  capture,  data  transmission  and  device 
power  management:  Completed  in  Q2.  We 
have  obtained  the  necessary  hardware 
and  software  for  controlling  the  electronic 
components  of  our  biosensor. 

We  assembled  components  to  support 
the  time-lapse  fluorescence  imaging  and 
analysis  of  cells  within  microfluidic  de¬ 
vices.  We  also  implemented  a  low  cost,  low 
power,  all-solid-state  PandaBoard  single¬ 
board  computer  to  control  image  acquisi¬ 
tion  and  peripherals.  We  implemented  a 
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fluorescence  imaging  assembly  designed  by 
the  Ziva  Corporation  (Fig.  25d)  to  mount  in 
our  prototype  imaging  scaffold.  Light  from 
a  blue  excitation  LED  reflects  off  a  dichroic 
mirror  and  illuminates  the  sample.  The  op¬ 
tics  collect  fluorescent  light  emitted  by  the 
sample  and  pass  it  though  the  dichroic  mir¬ 
ror  and  emission  filter  while  focusing  it  to 
an  image  at  the  CCD  camera.  We  devel¬ 
oped  a  custom  software  package  to  control 
data  acquisition,  regulate  pump  speed,  and 
maintain  appropriate  environmental  condi¬ 
tions  for  the  biosensor. 

Milestone  6.7:  Develop  a  chemical  control 
stock  to  test  the  proper  operation  of  the  biosen¬ 
sor  (i.e.  add  low  doses  of  toxins  for  positive  con¬ 
trol):  Completed  in  Ql.  We  developed  con¬ 
centrated  chemical  stocks  of  the  eight  tox¬ 
ins  that  are  easily  mixed  to  form  a  solution 
containing  low  doses  of  toxins  to  serve  as  a 
positive  control  in  testing  proper  biosensor 
operation. 

Milestone  6.8:  Develop  a  solar  powered  ver¬ 
sion  of  the  prototype,  including  battery  pan¬ 
els  and  charge  controller):  Completed  in  Q2. 
We  have  developed  an  independent  solar 
charging  station  that  can  easily  interface 
with  our  biosensor  device. 

To  ensure  compatibility  with  solar  or 
battery  power,  we  selected  all  components 
of  our  device  (including  the  heater)  to 
be  DC  powered.  We  then  located  and 
purchased  an  isolated  DC /DC  converter 
(Mean  Well  part  #SD-50A-12)  that  is  capa¬ 
ble  of  generating  a  regulated  12  VDC  out¬ 
put  from  an  unregulated  9.2-18  VDC  in¬ 
put.  The  regulated  12  VDC  supply  is  used 
to  power  our  pump,  heater,  and  electron¬ 
ics.  The  unregulated  9.2-18  VDC  input 
is  within  the  voltage  range  of  a  standard 
lead  acid  battery;  therefore,  we  purchased 
a  deep  cycle,  lead  acid  battery  (Interstate 
battery  part  #DCM0035)  with  sufficient  en¬ 
ergy  storage  (35  Ah)  to  power  our  device 
for  approximately  two  days.  To  charge  the 


battery,  we  purchased  a  100  W  monocrys¬ 
talline  solar  panel  and  a  30  A  pulse  wave 
modulation  (PWM)  charge  controller  from 
Renology  Solar.  The  solar  panel  compo¬ 
nents  were  assembled  and  mounted  onto  a 
galvanized  steel  pole  at  our  field  test  facil¬ 
ity. 

Milestone  6.9:  Develop  a  completed  device 
prototype  and  test  in  an  outdoor  environ¬ 
ment):  Completed  in  Q4.  We  have  assem¬ 
bled  the  individual  components  listed  in 
sub-milestones  6. 1-6.8  into  a  functional  pro¬ 
totype  that  is  capable  of  acquiring  and  pro¬ 
cessing  data.  Images  of  this  prototype  are 
shown  in  Figure  25. 

We  have  finalized  the  design  of  our  18- 
strain  microfluidic  device  that  can  collect 
toxin  response  data  from  18  different  toxin- 
specific  strains  simultaneously  (Fig.  24). 
This  is  housed  inside  our  sensor  prototype 
and  imaged  using  our  custom  optics.  The 
prototype  is  contained  in  a  16"xl4"  fiber¬ 
glass  enclosure  (Fig.  25a)  with  an  aluminum 
front  panel  designed  to  protect  the  inte¬ 
rior  components  from  the  environment.  A 
proportional-integral-derivative  (PID)  tem¬ 
perature  controller  is  visible  in  the  upper 
left  hand  corner  of  the  front  panel  with  the 
peristaltic  water  pump  in  the  lower  right 
corner.  The  interior  of  the  prototype  is 
shown  in  Fig.  25b  with  the  electronics  sub¬ 
enclosure  (Fig.  25b-l)  and  Ziva  optical  sys¬ 
tem  clearly  visible  (Fig.  25b-5).  The  use  of  a 
PID  controller  rather  than  a  simple  thermo¬ 
stat  is  necessary  for  precise  control  of  the 
enclosure's  internal  temperature  to  within 
0.1°C  of  the  set  point.  Individual  compo¬ 
nents  are  numbered  1-8.  Briefly,  the  elec¬ 
tronics  enclosure  (Fig.  25b-l)  contains  the 
hardware  for  controlling  the  Ziva  optics, 
processing  images,  and  transmitting  data. 
This  system  can  also  communicate  with  the 
temperature  controller  (Fig.  25b-2)  to  ad¬ 
just  the  interior  temperature  of  the  enclo¬ 
sure  (normally  kept  at  37°C)  using  the  Mod- 
bus  protocol. 
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Figure  24:  CAD  drawing  of  the  18-strain  microfluidic  chip  used  to  collect  toxin  response  data  from  18  dif¬ 
ferent  toxin-specific  strains  simultaneously.  Left)  View  of  the  entire  microfluidic  device.  The  resistance 
to  each  cell  trapping  region  is  identical,  ensuring  equivalent  flow  to  all  areas  of  the  device.  Black  chan¬ 
nels:  E.  coli  analyte  distribution  channels.  Blue  channels:  B.  subtilis  analyte  distribution  channels.  Red 
rectangle:  magnified  area  shown  in  the  right  panel.  Right)  The  field  of  view  captured  by  our  imaging  sys¬ 
tem.  Red  channels:  B.  subtilis  "gill"  trapping  regions.  Green  channels:  E.  coli  "gill"  trapping  regions.  The 
collections  of  rectangles  flanking  the  lower  E.  coli  trapping  region  are  present  to  assist  in  automatic  image 
registration. 


This  version  of  the  prototype  is  de¬ 
signed  to  be  powered  from  a  120  VAC 
source,  and  the  AC  power  distribution 
block,  along  with  a  supplemental  protec¬ 
tion  circuit  breaker  and  solid  state  relay  for 
controlling  the  heater,  is  shown  in  Fig.  25b- 
3.  A  tri-voltage  power  supply,  outputting 
5,  12,  and  24  VDC  for  the  electronics  and 
pumps,  is  shown  in  Fig.  25b-4.  The  Ziva 
optical  system  for  acquiring  image  data  is 
shown  in  Fig.  25b-5.  The  heater  is  combined 
with  a  circulating  fan  to  distribute  warm 
air  throughout  the  enclosure  (Fig.  25b-6). 
Briefly,  the  user  inputs  the  desired  temper¬ 
ature  through  a  custom  software  package 
designed  by  our  group  that  runs  on  the 
PandaBoard  system  (Fig.  25b-l).  The  Pand- 
aBoard  communicates  with  the  PID  con¬ 
troller  (Fig.  25b-2)  to  set  the  desired  tem¬ 


perature  using  the  Modbus  protocol.  The 
PID  controller  modulates  the  heater's  ac¬ 
tivity  based  on  the  set  point  temperature 
and  the  current  temperature  of  the  enclo¬ 
sure.  To  regulate  the  heater's  output,  the 
PID  controller  generates  a  pulse  wave  sig¬ 
nal  that  drives  the  activity  of  a  solid  state  re¬ 
lay  (Fig.  25b-3),  which  turns  the  heater's  AC 
power  source  on  and  off  (Note:  the  fan  re¬ 
mains  constantly  on  to  circulate  air).  Figure 
25b-7  shows  the  DC  power  distribution  sys¬ 
tem  for  the  electronics  and  peristaltic  water 
pump  (Fig.  25b-8). 

To  mount  the  necessary  electronics  for 
acquiring  and  processing  data  and  to  pro¬ 
tect  them  from  water  exposure,  we  de¬ 
signed  a  custom  sub-enclosure  using  Solid- 
Works  (Dassault  Systems)  and  contracted 
its  fabrication  using  additive  manufactur- 
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Figure  25:  Biosensor  self-contained  prototype:  a)  Device  enclosure,  note  temperature  controller  (upper  left) 
and  peristaltic  pump  (lower  right)  mounted  to  the  aluminum  front  panel,  b)  Enclosure  with  front  panel 
opened  to  expose  internal  components,  bl:  Electronics  sub-enclosure,  b2:  temperature  controller,  b3:  AC 
power  distribution  devices,  b4:  tri-output  DC  power  supply,  b5:  Ziva  optical  assembly,  b6:  fan/heater, 
b7:  DC  power  distribution  block,  b8:  peristaltic  pump,  c)  SolidWorks  detail  of  the  electronics  sub¬ 
enclosure.  cl:  Arduino  Uno,  c2:  BuckBlock  LED  drivers,  c3:  PandaBoard,  c4:  LED  control  relays,  d) 
Ziva  optical  assembly,  dl:  transmitted  light  optics,  d2:  stageltemperature  probe,  d3:  focal  adjustment 
knob,  d4:  dichroic  mirror  holder,  d5:  GFP  excitation  optics,  d6:  monochrome  camera,  e)  Microfluidic 
device  being  illuminated  with  the  GFP  excitation  LED.  f)  Image  of  the  prototype  running  in  an  outdoor 
environment  with  solar  power. 


ing  (3-D  printing)  by  a  local  machine  shop 
(Fig.  25c).  A  depiction  of  the  SolidWorks 
representation  of  this  enclosure,  showing 
an  Arduino  Uno  (Fig.  25c-l),  BuckBlock 
LED  drivers  (Fig.  25c-2),  PandaBoard  sys¬ 
tem  on  a  chip  (Fig.  25c-3)  and  LED  con¬ 
trol  relays  (Fig.  25c-4)  is  shown  in  this  Fig¬ 
ure.  The  PandaBoard  communicates  with 
the  Arduino  over  a  RS-232  (serial)  link  to 
modulate  the  LED  control  relays.  The  Ar¬ 
duino  then  generates  a  pulse  wave  mod¬ 
ulated  output  signal  that  is  interpreted  by 
the  BuckBlock  LED  drivers  to  control  the 
brightness  of  the  LEDs  (one  each  for  trans¬ 
mitted  light  and  GFP  excitation).  The 
PandaBoard  is  then  responsible  for  acquir¬ 


ing  an  image  from  the  Ziva  optical  sys¬ 
tem  (Fig.  25b-5)  and  finally  signalling  for 
the  LEDs  to  be  turned  off.  The  acquired 
data  is  analyzed  on  the  PandaBoard,  and 
the  results  are  transmitted  via  a  secure 
Wi-Fi  link  to  our  data  repository  server. 
The  software  to  accomplish  this  was  cus¬ 
tom  programmed  in  Java  and  C  using  the 
Point  Grey  FlyCapture  SDK  and  imple¬ 
mented  on  a  PandaBoard  ES  rev  B.3  run¬ 
ning  Ubuntu  Server.  The  various  com¬ 
ponents  of  the  Ziva  optical  system  are 
shown  in  Figure  25d,  including  the  trans¬ 
mitted  light  optics  (Fig.  25d-l),  the  mi¬ 
crofluidic  device  stage  and  thermistor  tem¬ 
perature  probe  (Fig.  25d-2),  the  focus  ad- 
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justment  system  (Fig.  25d-3),  the  dichroic 
mirror  holder  (Fig.  25d-4),  the  GFP  excita¬ 
tion  system  (Fig.  25d-5),  and  the  Point  Grey 
Chameleon  monochrome  camera  (Fig.  25d- 
6).  An  image  of  a  microfluidic  device,  illu¬ 
minated  with  the  Ziva  GFP  excitation  op¬ 
tics  is  shown  in  Figure  25e.  We  built  five 
replicates  of  this  completed  prototype  to 
run  in  our  laboratory  to  collect  data  for  the 
classifier.  As  proof  of  principle,  we  have 
also  demonstrated  that  the  biosensor  can 
run  in  an  outdoor  environment  and  draw 
a  sample  of  raw  water  from  a  public  water 
source  (Fig.  25f). 

Milestone  7:  Build  computational  models 
to  determine  the  threshold  of  detection  for 
specific  sensors  based  on  models  of  exper¬ 
imental  GFP  responses 

Summary:  We  have  successfully  com¬ 
pleted  this  milestone.  We  used  machine 
learning  techniques  to  determine  the  re¬ 
lationships  between  the  GFP  output  sig¬ 
nal  and  the  presence  of  a  toxin  (Milestone 
7.1).  We  have  created  a  database  contain¬ 
ing  the  collected  sensor  response  data  and 
have  incorporated  all  of  the  collected  data 
(Milestone  7.2).  We  have  quantified  the 
GFP  threshold  of  detection  for  each  sen¬ 
sor  construct  (Milestone  7.3).  We  have  con¬ 
structed  Receiver  Operating  Characteristic 
(ROC)  curves  for  each  sensor  to  achieve  ro¬ 
bust  sensing  (Milestone  7.4). 

Milestone  7.1:  Characterization  of  the  family 
of  GFP  sensor  responses  to  the  set  of  chemicals 
of  interest  via  computational  models):  Com¬ 
pleted  in  Q3.  We  constructed  machine 
learning  models  capable  of  inferring  the 
relationships  between  the  GFP  sensor  re¬ 
sponses  and  the  presence  or  absence  of  a 
toxin  at  a  given  concentration.  The  algo¬ 
rithm  learns  these  relationships  from  a  set 
of  training  samples  (GFP  sensor  responses) 
defined  by  the  set  of  experimental  condi¬ 
tions  from  which  they  were  generated.  The 


Toxin 

Cone.  (;/M) 

Accuracy  (%) 

Arsenic  (pRS18) 

0.2 

97.35 

0.55 

100 

1 

100 

Arsenic  (pZA47a) 

0.1 

97.35 

0.5 

100 

1 

100 

Cadmium 

0.022 

65.00 

0.044 

54.16 

0.44 

90.00 

Chromium  (VI) 

1.25 

96.67 

2.5 

97.50 

5 

94.17 

Copper 

5 

95.00 

10 

96.67 

20 

95.00 

Lead 

1.8 

95.83 

3.6 

95.00 

7.2 

95.00 

Mercury 

0.2 

63.33 

1 

87.50 

2 

99.17 

Ammonium 

1  ppm 

71.05 

5  ppm 

91.79 

Table  7:  Classification  accuracy  results  obtained 
from  the  GFP  sensor  responses  for  multiple  toxins 
at  different  concentrations. 

aim  of  the  algorithm  is  to  provide  a  general 
method  capable  of  determining  the  experi¬ 
mental  conditions  associated  with  GFP  sen¬ 
sor  responses  through  the  use  of  historical 
data. 

Specifically,  we  have  built  classification 
models  based  on  Support  Vector  Machines 
(SVMs),  which  is  one  of  the  most  popular 
classifiers  due  to  its  excellent  performance 
in  many  contexts  and  its  solid  mathemati¬ 
cal  basis.22  For  each  toxin  and  concentra¬ 
tion,  we  solved  a  binary  classification  prob¬ 
lem  in  which  the  positive  class  represents 
the  presence  of  the  toxin  in  water  and  the 
negative  class  is  associated  with  clean  envi¬ 
ronments.  Patterns  were  constructed  with 
features  containing  GFP  sensor  responses  at 
various  timestamps  to  capture  the  tempo- 
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ral  dynamics  of  the  GFP  signal.  The  op¬ 
timal  meta-parameters  of  the  SVM  classi¬ 
fier  were  determined  by  applying  a  5-cross- 
validation  during  the  training  phase.23 

In  order  to  have  a  reliable  estimate  of  the 
performance  of  the  model  when  deployed 
in  real  environments,  we  measured  its  per¬ 
formance  over  a  set  of  samples  (test  pat¬ 
terns)  not  seen  during  the  training  phase. 
The  SVM's  performance  was  determined 
by  the  percentage  of  test  samples  correctly 
labeled  as  toxin/no  toxin  (classification  ac¬ 
curacy).  80%  of  samples  were  used  for 
training  the  SVM  models  and  the  remain¬ 
ing  20%  of  samples  was  used  to  evaluate 
their  effectiveness.  We  generated  20  ran¬ 
dom  training /test  partitions  based  on  data 
collected  in  single-strain  chips  (no  crosstalk 
data)  to  have  an  estimate  of  performance. 
Table  7  shows  the  average  classification  ac¬ 
curacy  over  the  test  set  obtained  across  the 
20  random  partitions  for  each  binary  classi¬ 
fication  problem. 

Milestone  7.2:  Construction  of  a  database  of 
sensors'  responses  to  chemicals  of  interest  and 
null  chemicals  to  be  able  to  establish  the  statis¬ 
tical  significance  in  detection:  Completed  in 
Q4.  We  have  constructed  a  database  that  is 
stored  using  MySQL  in  a  Thecus  NAS  sys¬ 
tem  capable  of  storing  22  TB  of  data.  All 
of  the  image  sequences  gathered  from  the 
five  device  prototypes  have  been  organized 
into  a  directory  structure  for  each  device 
and  experiment.  A  master  file  containing 
the  time  stamp  of  each  image  and  its  file  lo¬ 
cation  has  been  compiled  for  quick  image 
access  via  a  Network  File  System  (NFS).  Af¬ 
ter  being  processed  for  rotation/ translation 
and  feature  extraction,  these  images  are  the 
input  variables  to  the  machine  learning  al¬ 
gorithms.  The  training  data  is  currently 
stored  using  the  following  columns  of  infor¬ 
mation: 

1 .  Date  time 

2.  Date  time  in  milliseconds  since  Unix  epoch 

3.  Experiment  ID 


Toxin 

Actual 

Estimated 

Std.  Dev. 

T-Test 

Cone. 

Cone. 

of  Est. 

Result 

O'M) 

0'M) 

Cone. 

Arsenic 

0.000000 

0.094873 

0.054253 

FAIL 

0.100000 

0.166705 

0.039146 

PASS 

0.200000 

0.121520 

0.045036 

PASS 

0.500000 

0.269255 

0.042667 

PASS 

1.000000 

0.290522 

0.029472 

PASS 

Cadmium 

0.000000 

0.073605 

0.039364 

FAIL 

0.025000 

0.084992 

0.035848 

PASS 

0.050000 

0.061783 

0.019154 

FAIL 

0.100000 

0.081460 

0.028670 

PASS 

0.440000 

0.258721 

0.054637 

PASS 

Chromium(VI) 

0.000000 

0.402180 

0.211879 

FAIL 

0.200000 

0.458967 

0.162400 

PASS 

0.500000 

0.558889 

0.163777 

PASS 

1.000000 

0.816885 

0.133032 

PASS 

5.000000 

1.215800 

0.324680 

PASS 

Copper 

0.000000 

1.037426 

0.628584 

FAIL 

0.100000 

0.138169 

0.120940 

FAIL 

0.500000 

0.890369 

0.184934 

FAIL 

1.000000 

1.428817 

0.437622 

PASS 

2.000000 

2.332876 

0.320600 

PASS 

5.000000 

1.875258 

0.285088 

PASS 

10.000000 

2.931468 

0.816670 

PASS 

Lead 

0.000000 

0.674518 

0.526677 

FAIL 

0.600000 

0.917133 

0.294116 

PASS 

1.800000 

1.636931 

0.326718 

PASS 

3.600000 

2.284836 

0.987410 

PASS 

7.200000 

3.417676 

1.085822 

PASS 

Mercury 

0.000000 

0.247743 

0.185409 

FAIL 

0.100000 

0.241039 

0.102734 

FAIL 

0.200000 

0.245659 

0.087578 

FAIL 

0.500000 

0.392381 

0.067889 

PASS 

2.000000 

0.968285 

0.084330 

PASS 

6.000000 

1.548434 

0.286386 

PASS 

Table  8:  Toxin  detection  residts  for  each  toxin  at  var¬ 
ious  concentrations  using  a  two-sample  T-test  at  a 
0.05  significance  level.  The  first  column  is  the  ac¬ 
tual  concentration,  the  second  column  is  the  aver¬ 
age  estimated  concentration,  the  third  column  is  the 
standard  deviation  of  the  estimation,  and  the  fourth 
column  is  the  residt  of  the  test. 

4.  Device  number 

5.  Toxin  concentration  at  the  start  point 

6.  Concentration  units  at  the  start  point 

7.  Toxin  at  the  start  point 

8.  Toxin  at  the  start  point  using  machine  learning 
label  (see  below) 

9.  Multiple  toxin  flag  at  the  start  point  (are  there 
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multiple  toxins?  Y/N) 

10.  Toxin  concentration  at  the  end  point 

11.  Concentration  units  at  the  end  point 

12.  Toxin  at  the  end  point 

13.  Toxin  at  the  end  point  using  machine  learning 
label 

14.  Multiple  toxin  flag  at  the  end  point 

15.  Responding  strains 

16.  Additional  notes 

17.  Delay  time  (time  before  the  toxin(s)  reach(es) 
the  cells) 

18.  Success  flag  (did  experiment  complete  suc¬ 
cessfully?  Y /N) 

The  machine  learning  toxin  labels  are  as 
follows:  MilliQ_ddH20  =  Code  1;  Arsenic, 
As  =  2;  Cadmium,  Cd  =  3;  Cobalt,  Co  =  4; 
Chromium (VI),  Cr  =  5;  Copper,  Cu  =  6; 
Mercury,  Hg  =  7;  Malathion,  Mai  =  8;  Lead, 
Pb  =  9;  gfp  Tracer  =  10;  and  Ammonium, 
NH4-N  =  11. 

Milestone  7.3:  Quantification  of  the  GFP 
threshold  of  detection  with  respect  to  the  concen¬ 
tration  levels  of  the  toxins):  Completed  in  Q3, 
with  updated  results  through  Q4.  Based  on 
the  results  of  Milestone  7.1  (Table  7)  and  the 
ROC  curves  (Fig.  26),  the  models  establish 
the  limits  of  detection  that  are  described  in 
Table  8.  Toxin  detection  results  at  various 
concentrations  were  determined  by  using 
a  two-sample  T-test  at  a  0.05  significance 
level.  A  nonlinear  support  vector  regressor 
was  used  with  the  metaparameters  cross- 
validated  using  5-fold  cross-validation. 

Milestone  7.4:  Construction  of  the  Receiver 
Operating  Characteristic  curve  (ROC)  for  the 
sensors  to  minimize  false  negatives  and  false 
positives):  Completed  in  Q3.  The  results  in 
Milestone  7.1  were  obtained  by  assuming 
that  the  penalties  of  misclassification  are 
identical  for  positive  and  negative  classes. 
In  other  words,  the  cost  of  classifying  a 
GFP  signal  as  "toxic"  when  it  is  not  (or  vice 
versa)  is  the  same.  However,  it  may  be  the 
case  that  the  cost  is  not  symmetric  for  pos¬ 
itive  and  negative  cases.  A  water  sensor 
is  a  good  example  of  this  situation,  since 


it  might  be  preferable  to  ensure  high  accu¬ 
racy  when  toxins  are  actually  in  the  water 
(true  positive  rate,  TP)  in  exchange  for  in¬ 
creasing  the  number  of  cases  that  are  clas¬ 
sified  as  "toxin  present"  when  there  is  not 
any  toxin  in  the  water  (false  positive  rate, 
FP).  The  Receiver  Operating  Characteristic 
(ROC)  curve  is  a  2-D  parametrized  curve 
used  to  quantify  and  represent  the  trade¬ 
off  between  the  true  positive  rate  and  the 
false  positive  rate  of  a  given  classifier.  The 
abscissa  represents  the  False  Positive  rate, 
while  the  ordinate  shows  the  True  Positive 
rate.  Therefore,  the  optimal  classifier  is  rep¬ 
resented  by  a  point  in  the  upper  left  cor¬ 
ner  of  the  ROC  curve,  since  this  point  cor¬ 
responds  to  the  best  possible  case  in  which 
the  classifier  is  able  to  correctly  identify 
100%  of  positive  cases  (toxin  present)  with 
no  false  alarms. 

The  parameter  that  defines  the  ROC 
curve  in  our  classification  model  is  the  deci¬ 
sion  threshold,  which  determines  whether 
a  pattern  (GFP  signal)  is  classified  as  pos¬ 
itive  (toxin  present)  or  negative  (toxin  not 
present).  The  SVM  model  provides  a  value 
(decision  function)  for  each  pattern  that 
represents  the  confidence  of  the  model  in 
its  prediction,  and  the  final  classification  is 
obtained  by  assigning  to  the  negative  class 
those  points  with  decision  functions  that 
are  below  the  decision  threshold,  and  clas¬ 
sifying  as  positive  samples  those  patterns 
with  decision  functions  above  this  thresh¬ 
old.  Therefore,  by  sweeping  a  grid  of  pos¬ 
sible  values  for  the  SVM  decision  threshold, 
we  obtained  the  ROC  curves  for  the  various 
toxins  shown  in  Figure  26. 

Milestone  8:  Use  the  models  obtained 
in  Milestone  7  to  investigate  whether 
the  combination  of  nonspecific  sensor  re¬ 
sponses  to  some  toxins  can  be  utilized  to 
improve  specificity 

Summary:  We  have  successfully  com¬ 
pleted  this  milestone.  We  have  demon- 
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Figure  26:  Receiver  Operating  Characteristic  (ROC)  curves  for  the  sensors.  This  parametrized  curve  quan¬ 
tifies  the  trade-off  between  the  true  positive  rate  and  the  false  positive  rate  of  a  classifier.  An  optimal 
classifier  is  represented  by  a  step  function  (e.g.  arsenic),  since  this  point  corresponds  to  correct  identifi¬ 
cation  of  100%  of  the  positive  cases  (toxin  present)  with  no  false  alarms.  In  the  other  limit,  worst-case 
classifiers  are  characterized  by  a  diagonal  ROC  curve.  Note  that  the  arsenic,  chromium(VI),  copper,  and 
lead  sensors  are  nearly  perfect  at  all  tested  levels,  while  very  low  levels  of  cadmium,  mercury,  and  ammo¬ 
nium  lead  to  suboptimal  classification  due  to  the  detection  limit  of  the  genetic  circuitry. 


strated  the  high  specificity  of  the  strains 
we  have  developed  (Milestone  8.1),  and  we 
have  used  state-of-the-art  pattern  recogni¬ 
tion  algorithms  to  improve  the  performance 
of  the  classifier  (Milestone  8.2). 

Milestone  8.1:  Estimation  of  the  number  of 
nonspecific  sensors  required  to  achieve  maxi¬ 
mum  specificity  in  the  discrimination  of  the  tar¬ 
get  chemicals:  Completed  in  Q4.  Because  we 
used  RNA-Seq  to  find  promoters  that  were 
highly  chemically  specific,  we  found  there 
to  be  only  a  small  amount  of  crosstalk.  The 
combinatoric  information  provided  by  non¬ 
specific  responses  was  used  to  strengthen 
classification  performance.  To  demonstrate 
this,  we  trained  a  classifier  using  a  non¬ 
linear  support  vector  machine  on  the  GFP 
images  concatenated  to  the  estimations  of 
the  pixel  derivatives  for  two  different  time 
scales.  In  Table  9  we  present  the  confu¬ 
sion  matrix  of  the  sensor-specific  strains  to 
six  different  chemicals  at  various  concen¬ 
trations.  The  derivation  of  this  matrix  in¬ 
cludes  all  of  the  available  concentrations  in 
the  experiments,  taking  into  account  any 
non-specific  responses  of  the  sensor-specific 


strains.  Importantly,  it  demonstrates  that 
the  classifier  can  discriminate  between  the 
toxins.  That  is,  in  each  row,  there  is  one 
maximum  number  ( i.e .  the  sensor  respond¬ 
ing  to  the  correct  toxin),  and  the  rest  of  the 
numbers  are  uniformly  low. 

Generally,  multi-class  classification  re¬ 
sults  nearing  80%  represent  a  highly  suc¬ 
cessful  algorithm,  particularly  given  the 
novelty  of  the  devices  and  the  data  acqui¬ 
sition  protocols.  One  issue  contributing  to 
negative  results  is  the  way  in  which  we 
probed  the  sensors.  We  initially  exposed 
the  sensors  to  higher  levels  of  toxins  to  ver¬ 
ify  that  we  could  detect  responses.  How¬ 
ever,  once  we  established  that  we  could 
sense  these  toxins  well,  we  dropped  down 
to  much  lower  levels  near  the  detection 
limit  for  the  majority  of  the  subsequent  in¬ 
ductions.  This  weighted  our  data  heavily 
toward  the  realm  of  low  responses  and  low 
signal,  skewing  our  results  to  contain  more 
errors.  As  we  have  only  begun  to  probe  the 
parameter  space,  we  expect  to  strengthen 
the  confusion  matrix  significantly  as  we 
move  forward. 

We  have  performed  a  similar  analysis  of 
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None 

Arsenic 

Cadmium 

Chromium  (VI) 

Copper 

Lead 

Mercury 

None 

87.4% 

2.0% 

2.6% 

1.6% 

0.8% 

2.6% 

3.0% 

Arsenic 

6.2% 

47.3% 

7.4% 

10.9% 

9.3% 

4.7% 

14.3% 

Cadmium 

6.7% 

6.2% 

73.3% 

1.3% 

4.6% 

2.8% 

5.1% 

Chromium(VI) 

4.8% 

10.6% 

1.5% 

74.9% 

3.4% 

3.4% 

1.3% 

Copper 

2.1% 

7.7% 

4.5% 

2.9% 

76.5% 

2.4% 

3.9% 

Lead 

6.4% 

2.9% 

2.6% 

2.7% 

2.3% 

81.3% 

1.8% 

Mercury 

6.2% 

10.1% 

4.0% 

0.9% 

3.1% 

1.5% 

74.2% 

Table  9:  Confusion  matrix  solving  a  multiclass  classification  task  for  the  following  classes:  baseline, 
Arsenic,  Cadmiun,  Chromium(VI),  Copper,  Lead,  and  Mercury.  Cobalt  also  performs  well,  but  there  is  not 
enough  data  to  provide  proper  statistics.  The  machine  learning  algorithm  used  to  calcidate  this  table  is 
a  calibrated  multiclass  support  vector  machine  using  10-fold  cross  validation  on  the  data.24  This  matrix 
should  be  read  from  left  to  right.  Tor  example,  lead  is  confused  with  baseline  activity  6.4%  of  the  time  at 
low  concentrations,  while  it  rarely  gets  confused  with  Mercury.  The  main  diagonal  indicates  how  often 
the  algorithms  provide  the  right  answer  (e.g.  81.3%  of  the  time  for  lead). 


our  recent  data  collected  using  the  18-strain 
chip.  This  chip  contains  the  new  ammo¬ 
nium  strain,  which  provides  for  a  new  con¬ 
fusion  matrix,  as  seen  in  Table  10.  This  is 
a  confusion  matrix  generated  by  taking  the 
average  of  ten  non-overlapping  partitions 
of  the  training  set  (80%  of  the  data)  and  the 
test  set  (20%  of  the  data).  Every  training 
set  of  a  partition  was  cross-validated  to  ob¬ 
tain  the  optimal  metaparameters.  After  the 
cross-validation,  the  models  were  applied 
to  the  test  set.  We  find  that  ammonium  can 
be  discriminated  from  the  rest  of  the  toxins 
83%  of  the  time. 

Milestone  8.2:  Apply  state  of  the  art  pattern 
recognition  algorithms  using  the  database  built 
for  Milestone  7  to  improve  performance  of  the 
detection:  Completed  in  Q4.  We  have  em¬ 
ployed  nonlinear  support  vector  machine 
classification24  on  images  that  have  under¬ 
gone  two  types  of  transformation:  1)  rota¬ 
tion  and  translation  of  the  image  with  re¬ 
spect  to  a  template  for  each  of  the  sensor 
device  setups  (Fig.  27),  and  2)  an  exponen¬ 
tial  moving  average  on  the  derivatives  of 
the  images  to  filter  the  pixel  noise.25,26 

We  tried  two  different  algorithms  for 
image  rotation  and  translation:  minimiza¬ 
tion  of  the  scalar  product  of  the  image  with 
the  template  reference  image,  and  a  nor- 


Tigure  27:  Left:  the  template  image  used  to  align  the 
images  from  all  of  the  devices.  Right:  the  residt  of 
applying  the  algorithm  to  an  image. 


malized  Euclidean  distance  from  the  bright 
field  image  to  the  template.  We  found  the 
Euclidean  distance  to  be  the  most  effective 
method.  The  algorithm  can  run  in  real  time 
to  track  the  variations  of  the  image's  loca¬ 
tion  in  the  device  (Fig.  27).  Once  the  images 
have  been  aligned,  we  can  apply  a  feature 
extraction  algorithm  to  zoom  in  on  the  traps 
where  the  strains  are  located. 

The  feature  extraction  algorithm  in¬ 
volves  two  phases:  image  reduction  and 
derivative  calculation.  The  image  reduc¬ 
tion  phase  applies  a  smoothing  algorithm 
on  neighboring  pixels  according  to  a  circu¬ 
lar  kernel  using  a  radius  of  41  pixels.  The 
weights  provided  to  the  kernel  have  the 
maximum  value  in  the  center  of  the  circle 
and  decrease  to  a  minimum  value  at  the 
border  of  the  circle.  After  the  smoothing 
has  been  applied,  the  image  is  scaled  from 
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None 

Arsenic 

Cadmium 

Chromium(VI) 

Cobalt 

Copper 

Lead 

Ammonium 

None 

45.02% 

7.66% 

0.75% 

13.48% 

9.69% 

2.56% 

5.26% 

15.64% 

Arsenic 

3.50% 

92.58% 

0.00% 

0.14% 

0.00% 

0.00% 

0.00% 

3.78% 

Cadmium 

4.72% 

0.00% 

93.11% 

2.17% 

0.00% 

0.00% 

0.00% 

0.00% 

Chromium  (VI) 

8.05% 

0.18% 

0.25% 

90.43% 

0.11% 

0.11% 

0.52% 

0.35% 

Cobalt 

9.75% 

0.00% 

0.00% 

0.21% 

89.32% 

0.49% 

0.00% 

0.22% 

Copper 

15.73% 

0.00% 

0.00% 

1.12% 

2.75% 

78.27% 

0.00% 

2.14% 

Lead 

7.71% 

0.00% 

0.00% 

1.22% 

0.00% 

0.00% 

90.76% 

0.31% 

Ammonium 

10.94% 

4.99% 

0.00% 

0.35% 

0.12% 

0.19% 

0.12% 

83.30% 

Table  10:  Confusion  matrix  obtained  from  the  18-strain  data. 
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Figure  28:  An  example  of  the  residts  of  the  image 
transformation  after  applying  the  alignment,  com¬ 
pression,  and  feature  extraction  on  the  GFP  signal. 
This  is  the  input  signal  that  is  provided  to  the  clas¬ 
sifier.  The  left  square  is  the  original  image,  and  the 
other  three  squares  show  the  changes  made  by  the 
image  processing  techniques. 

720  x  640  pixels  to  45x40  pixels.  This  oper¬ 
ation  reduces  the  size  of  the  feature  space 
significantly. 

The  feature  extraction  algorithm  next 
calculates  the  derivatives  of  the  images  to 
enhance  the  dynamic  changes  in  the  chip. 
The  formula  is  calculated  as  Ap^(t)  = 

akpij(t )  +  (1  —  a)(l  -  pk-(t  —  1)),  and  the 

time  scales  used  in  the  filter  are  a1  =  1/11, 
oc2  =  1/26,  and  a 3  =  1/51.  In  the  end,  the 
total  number  of  features  is  (45x40)x4.  In 
Figure  28  we  show  the  original  GFP  image 
and  the  result  of  the  derivative  estimation 
using  three  time  scales  (a1,  cc2,  a3). 

After  the  feature  extraction  algorithm 
has  been  applied,  the  classifier  is  trained  us¬ 
ing  80%  of  the  data  and  tested  using  the 
remaining  20%.  We  apply  a  10-fold  cross- 
validation  on  the  80%  of  the  training  data 
to  determine  the  optimal  model  parameters 
before  we  test  the  model  on  the  remaining 
20%.  The  confusion  matrices  are  then  cal¬ 
culated  by  running  an  average  on  ten  parti¬ 


tions  of  train  and  test.  For  the  8-strain  chip 
data,  the  cross-validated  classification  accu¬ 
racy  obtained  is  76%  with  a  standard  devi¬ 
ation  of  2.57%.  The  results  are  summarized 
in  the  confusion  matrix  (Table  9).  Arsenic 
is  the  toxin  that  has  the  highest  likelihood 
of  being  confused  with  other  toxins.  This  is 
due  to  collecting  a  majority  of  our  data  at 
very  low  arsenic  levels  to  test  the  limits  of 
the  sensor,  which  skews  the  results. 

We  have  been  continuously  collecting 
data  using  five  replicates  of  the  sensor  un¬ 
dergoing  different  alignment  and  feature 
extraction  algorithms.  Overall,  we  are  ex¬ 
tremely  satisfied  with  the  performance  of 
this  state-of-the-art  classifier  on  this  dataset, 
particularly  given  the  limited  amount  of 
data  collected  thus  far.  Moreover,  we  have 
been  intentionally  probing  the  strains  with 
concentration  levels  that  are  difficult  to  de¬ 
tect  and  classify.  As  we  continue  to  collect 
data  with  all  18  strains,  the  classifier  will 
improve  further. 

Milestone  9:  Develop  the  controller  board 
to  carry  out  sophisticated  analysis  of  com¬ 
plex  data 

Summary:  We  have  successfully  com¬ 
pleted  this  milestone.  A  PandaBoard  sys¬ 
tem  on  a  chip  (SoC)  was  selected  for  the 
primary  control  system  which  contains  in¬ 
tegrated  Wi-Fi  (Milestone  9.1).  We  have  de¬ 
veloped  advanced  image  processing  algo¬ 
rithms  and  embedded  them  on  the  Pand¬ 
aBoard  (Milestone  9.2).  We  have  developed 


40 


algorithms  which  can  easily  be  stored  and 
loaded  on  our  PandaBoard  SoC  (Milestone 
9.3). 

Milestone  9.1:  Design  the  controller  board 
with  wireless  capability:  Completed  in  Q2. 
We  tested  two  low-power  platforms  based 
on  the  Texas  Instruments  ARM  processor 
with  wireless  capability:  one  based  on  the 
Sitara  ARM  and  the  other  based  on  the 
Cortex- A9.  We  have  opted  for  the  more 
powerful  Cortex- A9  due  to  the  ease  of  use 
and  low  power  consumption.  We  are  cur¬ 
rently  using  a  PandaBoard,  which  is  pow¬ 
ered  by  a  Texas  Instruments  OMAP4430 
system  on  a  chip  (SoC)  device.  The 
OMAP4430  chipset  contains  a  dual-core  1 
GHz  ARM  Cortex-A9  MPCore  CPU  with 
1  GB  of  DDR2  SDRAM,  Wi-Fi  capability, 
and  an  SD  card  slot  offering  up  to  32  GB 
of  storage.  The  electronics  are  similar  to  a 
modern  smartphone  in  terms  of  processing 
power  and  power  consumption.  The  Pand¬ 
aBoard  solution  allows  us  to  install  a  Linux 
operating  system  so  that  we  can  use  stan¬ 
dard  GNU  compilers  and  run  our  software 
without  any  major  modifications,  which  we 
demonstrate  in  the  next  two  aims. 

Milestone  9.2:  Reduce  the  size  of  the  pattern 
recognition  algorithms  to  be  able  to  be  embed¬ 
ded  in  the  PandaBoard.  Completed  in  Q4. 
Fluorescence  images  are  used  to  train  our 
classifier  to  be  able  to  detect  and  discrimi¬ 
nate  between  different  toxins.  In  order  to 
speed  up  the  operation  of  the  classifier,  we 
must  reduce  its  computational  cost,  which 
is  directly  linked  to  the  number  of  images 
and  the  number  of  features  in  each  image. 
Each  image  consists  of  a  set  of  numerical 
features,  each  containing  the  intensity  of  a 
pixel  in  the  image.  Fortunately,  the  im¬ 
ages  contain  many  irrelevant  features  and 
regions  as  well  as  a  large  number  of  redun¬ 
dant  features  in  neighboring  pixels.  There¬ 
fore,  it  is  extremely  useful  to  apply  a  feature 
selection  algorithm  to  find  the  most  infor¬ 


mative  features  and  to  reduce  the  compu¬ 
tational  cost  of  the  classifier.  In  Milestone 
8.2,  we  describe  the  image  processing  and 
feature  selection  process  that  enabled  us  to 
reduce  computational  cost  and  embed  our 
algorithms  in  the  PandaBoard. 

Milestone  9.3:  Embed  the  algorithms  in  the 
microcontroller.  Completed  in  Q4.  We  have 
compiled  and  executed  nonlinear  support 
vector  machines24  on  the  images  described 
in  Milestone  9.2  after  the  feature  selection 
process.  We  compiled  the  software,  trained 
the  model,  and  ran  the  trained  model  with¬ 
out  any  problems. 

The  algorithms  were  developed  using 
the  openMP  API  and  compiled  using  GNU 
g++.  Both  are  well  established  and  stable 
options  that  run  well  on  the  PandaBoards. 
We  implemented  aggressive  compiler  opti¬ 
mizations  to  produce  native  code  that  runs 
fast  on  the  multicore  Cortex- A9  processors. 
Resulting  performance  was  more  than  suffi¬ 
cient  to  allow  real-time  operation,  with  our 
embedded  algorithms  proving  capable  of 
classifying  each  image  within  0.05  seconds. 

Updates  for  This  Reporting  Period 

Meetings/ Conferences/Workshops 

Hasty,  J.  (2014,  June  4).  Attended  the 
DARPA  Living  Foundries  meeting,  Denver, 
CO. 

Hasty,  J.,  Huerta,  R.,  &  Tsimring,  L.  S.  (2014, 
July  14).  An  online  biosensor  for  the  protection 
ofzvater  supplies.  Presented  at  the  Synthetic 
Biology:  Engineering,  Evolution  &  Design 
(SEED)  conference,  Manhattan  Beach,  CA. 

Subject  Invention  Disclosures 

UCSD  Docket  No.  SD2015-188:  An  Online 
Biosensor  for  the  Protection  of  Water  Sup¬ 
plies. 
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Subject  Invention  Patent  Applications 
None  at  this  time. 


Milestone  Schedule  Status 

All  tasks  have  been  completed  on  schedule. 
Below  find  a  Gantt  chart  showing  the  final 
task  completion  schedule. 
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Detailed  Methods 


Milestone  1.2:  Clone  the  promoters  into  an  expression  plasmid  driving  the  production  of 
GFP:  Synthetic  constructs  were  transformed  into  both  standard  E.  coli  MG1655  and  op¬ 
timized  E.  coli  LABEC01  cells  for  expression  analysis.  By  serially  passing  MG1655  in 
M9  medium  through  several  generations,  we  evolved  the  common  MG1655  lab  strain, 
which  has  been  adapted  for  growth  in  rich  lysogeny  broth  (LB)  medium,  into  a  strain 
well-suited  to  growth  in  M9  minimal  medium  within  our  microfluidic  devices.  The  M9- 
adapted  strain,  which  we  named  LABEC01,  exhibited  multiple  phenotypic  changes.  The 
growth  rate  in  minimal  medium  increased  by  approximately  10%,  colonies  on  minimal 
medium  plates  were  observed  to  be  smoother  as  compared  to  the  parent  MG1655,  and  the 
bacteria  aggregated  less  when  grown  in  microfluidic  devices.  To  investigate  how  these 
adaptations  affect  the  cellular  response  to  toxins  and  the  activity  of  sensitive  promoters, 
we  also  exposed  strain  LABEC01  to  the  full  set  of  toxins  for  RNA-Seq  analysis. 
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Appendix  A 


Candidate  toxin-responsive  promoters 


Toxin 

Gene/ 

Promoter 

Source 

RBS 

Host  Strain/ 

Plasmid 

Concentration 

Sensed  in 

Microfluidic 
Device  (;<M) 

SNR 

after 

6  h 

Ammonium 

PnasA 

B.  subtilis  genome 

native 

Ammonium 

PnasA 

B.  subtilis  genome 

synthetic 

Ammonium 

PnasB 

B.  subtilis  genome 

native 

Ammonium 

PnasB 

B.  subtilis  genome 

synthetic 

Ammonium 

Pspol— tnrAl 

B.  subtilis  genome 

synthetic 

Ammonium 

Pspol— tnrA2 

B.  subtilis  genome 

synthetic 

Arsenic 

arsR/ parsR 

E.  coli  plasmid 

native 

E.  coli  MG1655/Asl 

0.13 

20 

Arsenic 

arsR/ parsR 

E.  coli  genome 

synthetic 

E.  coli  LABEC01  / As3 

Arsenic 

(IVsR/ ParsR 

E.  coli  genome 

synthetic 

E.  coli  MG1655/ As3 

0.13 

33 

Arsenic 

arsR/ parsR 

S.  aureus  plasmid 

native 

E.  coli  MG1655/As5 

Arsenic 

ParsR 

E.  coli  RNA-Seq 

Cadmium 

cadC/ pcadc 

S.  aureus  plasmid 

native 

E.  coli  MG1655/Cdl 

0.04 

17 

Cadmium 

cade/  PcadC 

S.  aureus  plasmid 

synthetic 

E.  coli  MG1655/Cd2 

Cadmium 

CadR/  pCadR 

P.  putida  genome 

native 

E.  coli  MG1655/Cd4 

Cadmium 

CadR/  pCadR 

P.  putida  genome 

synthetic 

E.  coli  MG1655/Cd3 

Cadmium 

PzntA 

E.  coli  RNA-Seq 

Chromium(VI) 

chrB/  PchrB 

C.  metallidurans  plasmid 

native 

E.  coli  MG1655/Cr3 

Chromium(VI) 

chrB/  pchrB 

C.  metallidurans  plasmid 

synthetic 

E.  coli  MG1655/G-2 

Chromium(VI) 

chrB/ pchrB 

O.  tritici  transposon 

native 

E.  coli  LABEC01/Cr5 

5 

5 

Chromium(VI) 

chrB/  pchrB 

0.  tritici  transposon 

native 

E.  coli  MG1655/G-5 

Chromium(VI) 

chrB/  pchrB 

0.  tritici  transposon 

synthetic 

E.  co/i  LABEC01  /Cr4 

Chromium(VI) 

chrB/ pchrB 

O.  tritici  transposon 

synthetic 

E.  co/i  MG1655/Cr4 

Chromium(VI) 

chrS/  pchrS 

B.  subtilis  genome 

synthetic 

E.  co/i  MG1655/Crl 

Chromium(VI) 

PrecN 

E.  coli  RNA-Seq 

Chromium(VI) 

PsulA 

E.  coli  RNA-Seq 

Chromium(VI) 

PumuD 

E.  coli  RNA-Seq 

Cobalt 

PdadA 

E.  coli  RNA-Seq 

Cobalt 

Phmp 

E.  coli  RNA-Seq 

Cobalt 

PilvB 

E.  coli  RNA-Seq 

Cobalt 

PlipA 

E.  coli  RNA-Seq 

Cobalt 

PmmuP 

E.  coli  RNA-Seq 

native 

E.  co/i  MG1655/Co7 

Cobalt 

PmmuP 

E.  coli  RNA-Seq 

synthetic 

E.  co/i  MG1655/Co8 

Cobalt 

nmtR/ PnmtR 

M.  tuberculosis  genome 

native 

E.  co/i  MG1655/Col 

Cobalt 

WlltR/ PnmtR 

M.  tubercidosis  genome 

synthetic 

E.  co/i  MG1655/Co2 

Cobalt 

PsoxR 

E.  coli  RNA-Seq 

Cobalt 

PtehA 

E.  coli  RNA-Seq 

Cobalt 

PygbA 

E.  coli  RNA-Seq 

native 

E.  coli  MG1655/Co3 

Cobalt 

PygbA 

E.  coli  RNA-Seq 

synthetic 

E.  co/i  MG1655/Co4 

Cobalt 

PyjbJ 

E.  coli  RNA-Seq 

native 

E.  co/i  MG1655/Co5 

Cobalt 

PyjbJ 

E.  co/i  RNA-Seq 

synthetic 

E.  co/i  MG1655/Co6 

Cobalt 

PyqfA 

E.  coli  RNA-Seq 

Copper 

cueR/ pcopA 

E.  co/i  genome 

native 

E.  co/i  MG1655/Cul 

25 

65 

Copper 

(cusS/R)  /  PcusC 

E.  co/i  genome 

native 

E.  co/i  MG1655/Cu2 

Copper 

PcusC 

E.  coli  RNA-Seq 

Copper 

PcusR 

E.  coli  RNA-Seq 

Continued  on  next  page 
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Table  11  -  continued  from  previous  page 


Toxin 

Gene/ 

Promoter 

Source 

RBS 

Host  Strain/ 

Plasmid 

Concentration 

Sensed  in 

Microfluidic 
Device  (pM) 

SNR 

after 

6  h 

Lead 

pbrR/ ppbrR 

C.  metallidurans  plasmid 

native 

E.  coli  LABECOl/Pbl 

Lead 

pbrR/ ppbrR 

C.  metallidurans  plasmid 

native 

E.  coli  MG1655/Pbl 

Lead 

pbrR/ PpbrR 

C.  metallidurans  plasmid 

synthetic 

E.  coli  LABEC01  /Pb2 

7 

18 

Lead 

pbrR/ PpbrR 

C.  metallidurans  plasmid 

synthetic 

E.  coli  MG1655/Pb2 

Lead 

PmntH 

E.  coli  RNA-Seq 

Lead 

PshiA 

E.  coli  RNA-Seq 

Lead 

Pybil 

E.  coli  RNA-Seq 

Lead 

Pyp 

E.  coli  RNA-Seq 

Malathion 

PcusC 

E.  coli  RNA-Seq 

Malathion 

PnemR 

E.  coli  RNA-Seq 

Mercury 

merR/ PmerR 

E.  coli  plasmid 

native 

E.  coli  MG1655/Hg4 

Mercury 

merR/ pmerR 

E.  coli  plasmid 

synthetic 

E.  coli  MG1655/Hg3 

0.1 

20 

Mercury 

nterR/ pmerR 

S.  aureus  plasmid 

native 

E.  coli  MG1655/Hg2 

Mercury 

merR/ pmerR 

S.  aureus  plasmid 

synthetic 

E.  coli  MG1655/Hgl 

Mercury 

merR/ pmerR 

S.  marcescens  plasmid 

native 

E.  coli  MG1655/Hg6 

Mercury 

merR/ PmerR 

S.  marcescens  plasmid 

synthetic 

E.  coli  MG1655/Hg5 

Zinc 

PmntH 

E.  coli  RNA-Seq 

Zinc 

PshiA 

E.  coli  RNA-Seq 

Zinc 

Pyjjz 

E.  coli  RNA-Seq 

Zinc 

PzntA 

E.  coli  RNA-Seq 

Zinc 

PzraP 

E.  coli  RNA-Seq 

Table  11:  All  candidate  toxin-responsive  promoters  identified  in  this  work,  ordered  by  the  toxin  of  expected 
sensitivity.  In  the  case  of  promoters  identified  by  RNA-Seq,  the  gene  is  unknown.  For  promoters  that 
have  been  expressed  in  a  synthetic  construct,  the  selected  RBS  and  host  strain  are  shown.  If  this  synthetic 
construct  has  been  used  to  sense  the  toxin  within  a  microfluidic  device,  the  concentration  sensed  and  SNR 
after  6  h  are  shown. 
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Appendix  B 

RNA-Seq  results  for  promoter  activation  in  E.  coli  MG1655  in  response  to  single  and  multiple 
toxin  exposures  at  low  and  high  concentrations: 

Our  analysis  of  the  sequencing  data  from  RNA-Seq  experiments  to  determine  candidate  genes 
that  are  induced  upon  toxin  exposure  consisted  of  three  main  tasks:  sequence  alignment,  quan¬ 
tification  of  gene  expression,  and  identification  of  differentially  expressed  genes. 

Sequence  alignment:  Reads  were  aligned  to  the  reference  E.  coli  K-12  substr.  MG1655  genome 
using  a  tolerance  of  at  most  two  mismatches  per  alignment  to  protect  against  sequencing  errors. 
The  alignment  was  performed  using  Bowtie  software,27  which  is  known  to  be  very  efficient  in 
aligning  reads  to  a  reference  genome. 

Quantification  of  gene  expression:  The  expression  level  of  each  gene  was  determined  as  a 
function  of  the  number  of  aligned  reads  mapping  to  the  gene.  After  analyzing  several  approaches 
adopted  in  the  literature  to  tabulate  the  number  of  reads  mapping  to  each  gene,  we  implemented 
our  own  software  capable  of  reproducing  the  counting  algorithms  behind  some  of  the  standard 
toolboxes  such  as  Bedtools28  and  HTSeq.29  In  particular,  we  counted  the  number  of  reads  mapping 
to  each  gene  regardless  of  whether  the  read  mapped  to  several  genes,  taking  into  account  the 
strand-specificity  of  each  read.  Additionally,  we  implemented  our  own  algorithms  for  sequence 
alignment  and  quantification  of  gene  expression  in  order  to  crosscheck  all  results. 

Identification  of  differentially  expressed  genes:  Finally,  a  set  of  statistical  and  information 
theory  algorithms  were  applied  in  order  to  extract  not  only  differentially  expressed  (DE)  genes  for 
each  toxin  with  respect  to  the  control  samples  (pure  water)  but  also  toxin-specific  genes. 

DESeq  is  a  standard  tool  for  identifying  DE  genes  that  allowed  us  to  select  sensitive  genes 
with  differential  expression  between  the  control  samples  (pure  water)  and  the  cells  exposed  to 
toxin.  It  assumes  that  the  number  of  counts  for  each  gene  across  experimental  replicates  follows 
a  negative  binomial  distribution.30'31  We  considered  genes  with  a  False  Discovery  Rate  (FDR) 
lower  than  1%  as  DE  in  order  to  ensure  statistically  robust  DE  genes.  We  note  that  some  genes 
showed  high  variability  in  the  control  samples  across  different  batches  of  RNA-Seq  experiments, 
indicating  that  these  genes  are  very  sensitive  to  environmental  conditions.  We  identified  846  of 
these  genes  by  performing  a  DESeq  differential  analysis  (FDR<1%)  between  the  control  samples 
in  different  batches  and  subsequently  removed  them  from  the  candidate  pool.  The  number  of 
DE  genes  (FDR<1%)  identified  for  each  condition  when  compared  to  the  negative  samples  in  the 
same  batch  and  after  removing  genes  that  are  DE  between  control  samples  is  given  in  Table  6. 

Ideally,  good  candidate  specific  genes  are  those  with  a  significant  fold-change  with  respect 
to  the  control  samples  but  with  a  negligible  fold-change  with  respect  to  the  other  toxins.  Addi¬ 
tionally,  genes  with  the  largest  number  of  counts  and  expression  levels  are  preferable  in  order  to 
maximize  the  signal-to-noise  ratio.  When  it  is  not  possible  to  find  toxin-specific  genes,  the  next 
generation  of  good  candidates  is  formed  by  those  genes  satisfying  the  above  properties  for  a  small 
subset  of  toxins  (multiple-toxin  response).  It  is  desirable  to  have  single-toxin-specific  genes  for 
several  of  the  toxins  in  the  combination  in  order  to  determine  toxin-specific  multi-gene-responses 
by  means  of  logical  operations. 

In  rare  cases,  shared  genes  are  differentially  expressed.  Therefore,  we  have  developed  infor¬ 
mation  theoretic  measures  to  improve  the  toxin  separability.  The  core  idea  of  the  approach  is  that 
low  entropies  (or  highly  informative  genes)  correspond  to  toxin-specific  genes,  while  large  en¬ 
tropies  (low  information)  are  associated  with  scenarios  in  which  DE  fold-changes  across  different 
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toxins  are  similar  and  should  be  discarded.  The  result  of  the  analysis  shows  that  toxins  can  easily 
be  discriminated  by  using  simple  boolean  rules  as  shown  in  the  main  report. 


Cadmium  and  Zinc  sensing  gene  zntA  (E.  coli sir.  K-12  substr.  MG1655) 
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Figure  29:  Response  of  the  zntA  promoter  in  E.  coli  MG1655  for  sensing  cadmium  and  zinc.  Note  that  the 
zntA  promoter  responds  monotonically  to  increasing  concentrations  of  cadmium  alone  and  in  combination 
with  other  heavy  metals  (rightmost  conditions)  without  exhibiting  crosstalk.  A  cadmium-specific  sensor 
can  be  implemented  by  combining  this  response  with  the  zinc-specific  response  of  zraP  using  the  boolean 
expression  (zntA)A(-zraP). 
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Chromium(VI)  sensing  gene  recN  (E.  coli str.  K-12  substr.  MG1655) 
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Figure  30:  Response  of  the  recN  promoter  in  E.  coli  MG1655  for  specifically  sensing  chromium(VI).  Note 
that  the  response  of  the  recN  promoter  is  specific  to  chromium(VI)  alone  but  only  shows  sensitivity  to  high 
concentrations  of  chromium(VI). 
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Chromium(VI)  sensing  gene  sulA  ( E .  coli  str.  K-12  substr.  MG1655) 


-2  2  - 


0 

O) 

c 

CO 

_c 

o 

O 

c 

CO 

CD 


CD 


4  F 


3  - 


- 

- 

□DD—i—i 

nnimnnlrra 

c= 

8  0.4 

"D 

CD 

N 

|  0.2 


0 

0 

®  100 

c 

o 

'CD 
CD 
CD 


Q_ 

X 

LD 

LL 

Q 

O 

vO 


50  - 


Experimental  condition 


bill 


mmi*  i 


Experimental  condition 


1 


in  i 


Experimental  condition 


AsLow 

. 

II  AsHiah 

I _ |Co 

H]  NiLow 

^HNiHigh 

FeLow 

1  1  FeHigh 

|  ZnLow 
^HZnHigh 

- 

H  CrLow 

1  1  CrHigh 

HI  CdLow 

1  1  CdHigh 

HI  Fig  Low 

HgHigh 

HI  CuLow 

II  CuFHigh 

- 

1  1 PbLow 

PbHigh 

II  MalLow 

1  1  MalHigh 

■  Flg+Cd 

1  1 As+Hg 

As+Cd 

Figure  31:  Response  of  the  sulA  promoter  in  E.  coli  MG1655  for  specifically  sensing  chromium(VI).  Note 
that  the  response  of  the  sulA  promoter  is  specific  to  chromium(VI)  alone  hut  only  shows  sensitivity  to  high 
concentrations  of  chromium(VI). 
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Chromium(VI)  sensing  gene  umuD  (E.  coli  str.  K-12  substr.  MG1655) 
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Figure  32:  Response  of  the  umuD  promoter  in  E.  coli  MG1655  for  specifically  sensing  chromium(VI).  Note 
that  the  response  of  the  umuD  promoter  is  specific  to  chromium(VI)  alone  but  only  shows  sensitivity  to 
high  concentrations  of  chromium(VI). 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  dadA  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  33:  Response  of  the  dadA  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  dadA  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  hmp  ( E .  coli  str.  K-12  substr.  MG1655) 
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Figure  34:  Response  of  the  hmp  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  hmp  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  ilvB  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  35:  Response  of  the  ilvB  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  ilvB  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  lipA  ( E .  coli  str.  K-12  substr.  MG1655) 


3 

2 

1 

0 

1.5 

1 

0.5 

0 

100 


— 

- 

1  JU_i 

Experimental  condition 


i 


Experimental  condition 


Experimental  condition 


3  AsLow 

- 

II  AsHiah 

I _ |  Co 

|  NiLow 

1  |  NiHigh 

■  FeLow 

|  1  FeFHigh 

|  ZnLow 
^HZnHigh 

- 

3  CrLow 

1  ICrHiqh 

3  CdLow 

1  1  CdHigh 

3  HgLow 

1  HqHiqh 

HI  CuLow 

II  CuFligh 

- 

1  1 PbLow 

PbHigh 

II  MalLow 

1  1  MalHigh 

3  Flg+Cd 

As+Hq 

1  1 As+Cd 

Figure  36:  Response  of  the  lipA  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  lipA  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  mmuP  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  37:  Response  of  the  mmuP  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  mmuP  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  soxR  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  38:  Response  of  the  soxR  promoter  in  E.  coli  MG1655  for  specifically  sensing  cohalt.  Note  that  the 
response  of  the  soxR  promoter  is  specific  to  cohalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  tehA  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  39:  Response  of  the  tehA  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  tehA  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  ygbA  ( E .  coli  str.  K-12  substr.  MG1655) 
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Figure  40:  Response  of  the  ygbA  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  ygbA  promoter  is  specific  to  cobalt  alone. 
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%CDF  Expression  level  Normalized  counts  Mean  fold  change 


Cobalt  sensing  gene  yjbJ  (E.  coli  str.  K-12  substr.  MG  1655) 
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Figure  41:  Response  of  the  yjbJ  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  yjbJ  promoter  is  specific  to  cobalt  alone. 
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Cobalt  sensing  gene  yqfA  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  42:  Response  of  the  yqfA  promoter  in  E.  coli  MG1655  for  specifically  sensing  cobalt.  Note  that  the 
response  of  the  yqfA  promoter  is  specific  to  cobalt  alone. 
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Copper  and  malathion  sensing  gene  cusC  (E.  coli  str.  K-12  substr.  MG1655) 
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Figure  43:  Response  of  the  cusC  promoter  in  E.  coli  MG1655  for  sensing  copper  and  malathion.  Note  that 
the  cusC  promoter  responds  monotonically  to  increasing  concentrations  of  copper  alone  and  to  low  concen¬ 
trations  of  malathion  alone.  A  copper-specific  sensor  can  be  implemented  by  combining  this  response  with 
the  malathion-specific  response  of  nemR  using  the  boolean  expression  (cusC)  A(-nemR) .  Alternatively,  a 
malathion-specific  sensor  can  be  implemented  by  combining  this  response  with  the  copper-specific  response 
ofcusR  using  the  boolean  expression  (cusC)ACcusR). 
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Copper  sensing  gene  cusR  (E.  coli  str.  K-12  substr.  MG  1655) 


AsLow 

AsHigh 

Co 

NiLow 

NiHigh 

FeLow 

FeHigh 

ZnLow 

ZnHigh 

CrLow 

CrHigh 

CdLow 

CdHigh 

Fig  Low 

HgHigh 

CuLow 

CuHigh 

PbLow 

PbHigh 

MalLow 

MalHigh 

Flg+Cd 

As+Hg 

As+Cd 


Figure  44:  Response  of  the  cusR  promoter  in  E.  coli  MG1655  for  specifically  sensing  copper.  Note  that  the 
response  of  the  cusR  promoter  is  specific  to  copper  alone  hut  only  shows  sensitivity  to  high  concentrations 
of  copper. 
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Lead  and  zinc  sensing  gene  mntH  (E.  coli  str.  K-12  substr.  MG  1655) 
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Figure  45:  Response  of  the  mntH  promoter  in  E.  coli  MG1655  for  sensing  lead  and  zinc.  Note  that  the  mntH 
promoter  responds  to  high  concentrations  of  lead  alone  and  high  concentrations  of  zinc  alone.  A  lead- 
specific  sensor  can  be  implemented  by  combining  this  response  with  the  zinc-specific  response  ofzraP  using 
the  boolean  expression  (mntH) A (~zra P) .  Alternatively,  a  zinc-specific  sensor  can  be  implemented  by  com¬ 
bining  this  response  with  the  lead-specific  response  ofybil  using  the  boolean  expression  (mntH)f\(-ybiI). 
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Lead  and  zinc  sensing  gene  shiA  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  46:  Response  of  the  shiA  promoter  in  E.  coli  MG1655  for  sensing  lead  and  zinc.  Note  that  the  shiA 
promoter  responds  to  high  concentrations  of  lead  alone  and  high  concentrations  of  zinc  alone.  A  lead- 
specific  sensor  can  be  implemented  by  combining  this  response  with  the  zinc-specific  response  ofzraP  using 
the  boolean  expression  (shiA)/\(~<zraP).  Alternatively,  a  zinc-specific  sensor  can  be  implemented  by  com¬ 
bining  this  response  with  the  lead-specific  response  ofybil  using  the  boolean  expression  (shiA)/\(~ybil). 
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Lead  sensing  gene  ybil  (E.  coli  str.  K-12  substr.  MG1655) 
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Figure  47:  Response  of  the  ybil  promoter  in  E.  coli  MG1655  for  specifically  sensing  lead.  Note  that  the  ybil 
promoter  responds  specifically  and  monotonically  to  increasing  concentrations  of  lead  alone. 
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Lead  and  zinc  sensing  gene  yjjZ  ( E .  coli  str.  K-12  substr.  MG  1655) 
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Figure  48:  Response  of  the  yjjZ  promoter  in  E.  coli  MG1655  for  sensing  lead  and  zinc.  Note  that  the  yjjZ 
promoter  responds  monotonically  to  increasing  concentrations  of  zinc  alone  and  to  high  concentrations  of 
lead  alone.  A  lead-specific  sensor  can  be  implemented  by  combining  this  response  with  the  zinc-specific 
response  of  zraP  using  the  boolean  expression  (yjjZ) A ( ^zra P).  Alternatively,  a  zinc-specific  sensor  can  be 
implemented  by  combining  this  response  with  the  lead-specific  response  ofybil  using  the  boolean  expres¬ 
sion  (yjjZ)A(-iybil). 
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Malathion  sensing  gene  nemR  (E.  coli  str.  K-12  substr.  MG1655) 
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Figure  49:  Response  of  the  nemR  promoter  in  E.  coli  MG1655  for  specifically  sensing  malathion.  Note  that 
the  nemR  promoter  responds  to  high  concentrations  of  malathion  alone. 
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Zinc  sensing  gene  zraP  ( E .  coli  str.  K-12  substr.  MG  1655) 


J 

. : 

Experimental  condition 

- 

- 

Experimental  condition 

_ muJHUify 

I  llfillllllMjiliniJll-:  :  ;ilf:....-rj:]-iL 

Experimental  condition 


] AsLow 
]  AsHigh 
Co 

NiLow 
NiHigh 
FeLow 
FeHigh 
Z\  ZnLow 
IZnHigh 
Z\  CrLow 
□  CrHigh 
~|  CdLow 


CdHigh 

HgLow 

HgHigh 

CuLow 

CuHigh 

PbLow 


PbHigh 


MalLow 

MalHigh 

Flg+Cd 

As+Hg 

As+Cd 


Figure  50:  Response  of  the  zraP  promoter  in  E.  coli  MG1655  for  specifically  sensing  zinc.  Note  that  the 
zraP  promoter  responds  specifically  and  monotonically  to  increasing  concentrations  of  zinc  alone. 
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